Dataset statistics
| Number of variables | 36 |
|---|---|
| Number of observations | 78032 |
| Missing cells | 647063 |
| Missing cells (%) | 23.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 20.9 MiB |
| Average record size in memory | 281.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 23 |
| Boolean | 2 |
Name has a high cardinality: 22710 distinct values | High cardinality |
Address has a high cardinality: 6618 distinct values | High cardinality |
StreetName has a high cardinality: 669 distinct values | High cardinality |
BldgNo has a high cardinality: 94 distinct values | High cardinality |
UnitNo has a high cardinality: 3322 distinct values | High cardinality |
PostalCode has a high cardinality: 2901 distinct values | High cardinality |
Location has a high cardinality: 56 distinct values | High cardinality |
NAICSDescr has a high cardinality: 1039 distinct values | High cardinality |
Phone has a high cardinality: 25064 distinct values | High cardinality |
Fax has a high cardinality: 15752 distinct values | High cardinality |
TollFree has a high cardinality: 4117 distinct values | High cardinality |
EMail has a high cardinality: 15058 distinct values | High cardinality |
WebAddress has a high cardinality: 14200 distinct values | High cardinality |
EmplUpdate has a high cardinality: 433 distinct values | High cardinality |
Character has a high cardinality: 56 distinct values | High cardinality |
CHArea has a high cardinality: 57 distinct values | High cardinality |
Modified has a high cardinality: 189 distinct values | High cardinality |
X is highly overall correlated with Y and 2 other fields | High correlation |
Y is highly overall correlated with X and 2 other fields | High correlation |
BusinessID is highly overall correlated with FID and 2 other fields | High correlation |
Ward is highly overall correlated with FID and 8 other fields | High correlation |
CENT_X is highly overall correlated with Location and 2 other fields | High correlation |
CENT_Y is highly overall correlated with Location and 2 other fields | High correlation |
Year is highly overall correlated with X and 3 other fields | High correlation |
RecordID is highly overall correlated with FID and 2 other fields | High correlation |
isnew is highly overall correlated with X and 2 other fields | High correlation |
CHArea is highly overall correlated with FID and 6 other fields | High correlation |
Character is highly overall correlated with FID and 4 other fields | High correlation |
Sector_Des is highly overall correlated with NAICSCat | High correlation |
BIAFulName is highly overall correlated with FID and 3 other fields | High correlation |
BIA_NAME is highly overall correlated with FID and 3 other fields | High correlation |
Closed is highly overall correlated with BIAFulName and 1 other fields | High correlation |
FID is highly overall correlated with BusinessID and 8 other fields | High correlation |
BldgNo is highly overall correlated with Location and 2 other fields | High correlation |
Location is highly overall correlated with FID and 7 other fields | High correlation |
NAICSCode is highly overall correlated with NAICSCat | High correlation |
NAICSCat is highly overall correlated with Location and 5 other fields | High correlation |
PIN is highly overall correlated with FID and 3 other fields | High correlation |
X has 48605 (62.3%) missing values | Missing |
Y has 48605 (62.3%) missing values | Missing |
Location has 47693 (61.1%) missing values | Missing |
EmplUpdate has 15002 (19.2%) missing values | Missing |
Sector_Des has 63430 (81.3%) missing values | Missing |
CENT_X has 47693 (61.1%) missing values | Missing |
CENT_Y has 47693 (61.1%) missing values | Missing |
PIN has 30339 (38.9%) missing values | Missing |
Character has 61682 (79.0%) missing values | Missing |
CHArea has 46689 (59.8%) missing values | Missing |
Modified has 63217 (81.0%) missing values | Missing |
BIA_NAME has 63207 (81.0%) missing values | Missing |
BIAFulName has 63207 (81.0%) missing values | Missing |
StreetNo is highly skewed (γ1 = 147.6524357) | Skewed |
Reproduction
| Analysis started | 2023-03-04 22:45:32.820108 |
|---|---|
| Analysis finished | 2023-03-04 22:46:38.756839 |
| Duration | 1 minute and 5.94 seconds |
| Software version | pandas-profiling vv3.5.0 |
| Download configuration | config.json |
| Distinct | 8684 |
|---|---|
| Distinct (%) | 29.5% |
| Missing | 48605 |
| Missing (%) | 62.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 306553.47 |
| Minimum | -79.80298 |
|---|---|
| Maximum | 617060.11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 14602 |
| Negative (%) | 18.7% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | -79.80298 |
|---|---|
| 5-th percentile | -79.716419 |
| Q1 | -79.64992 |
| median | 598535.65 |
| Q3 | 608829.52 |
| 95-th percentile | 613567.3 |
| Maximum | 617060.11 |
| Range | 617139.91 |
| Interquartile range (IQR) | 608909.17 |
Descriptive statistics
| Standard deviation | 304335.28 |
|---|---|
| Coefficient of variation (CV) | 0.99276409 |
| Kurtosis | -1.9996012 |
| Mean | 306553.47 |
| Median Absolute Deviation (MAD) | 17202.025 |
| Skewness | -0.014922506 |
| Sum | 9.0209489 × 109 |
| Variance | 9.261996 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 609566.1112 | 201 | 0.3% |
| -79.64275968 | 185 | 0.2% |
| -79.60364656 | 123 | 0.2% |
| 607701.737 | 121 | 0.2% |
| -79.71222857 | 113 | 0.1% |
| -79.63864759 | 107 | 0.1% |
| 604057.4854 | 101 | 0.1% |
| 609718.3353 | 100 | 0.1% |
| -79.56936408 | 91 | 0.1% |
| 615498.4771 | 66 | 0.1% |
| Other values (8674) | 28219 | |
| (Missing) | 48605 |
| Value | Count | Frequency (%) |
| -79.80298035 | 1 | < 0.1% |
| -79.8014612 | 1 | < 0.1% |
| -79.79447393 | 1 | < 0.1% |
| -79.79439767 | 1 | < 0.1% |
| -79.78884298 | 1 | < 0.1% |
| -79.78871792 | 20 | |
| -79.78850259 | 1 | < 0.1% |
| -79.78675536 | 5 | < 0.1% |
| -79.78630211 | 12 | |
| -79.78452433 | 11 |
| Value | Count | Frequency (%) |
| 617060.1055 | 1 | |
| 616918.4738 | 1 | |
| 616839.6893 | 1 | |
| 616837.5953 | 1 | |
| 616769.3441 | 1 | |
| 616704.5391 | 1 | |
| 616692.2284 | 1 | |
| 616667.6043 | 1 | |
| 616657.8816 | 1 | |
| 616643.3766 | 1 |
| Distinct | 8684 |
|---|---|
| Distinct (%) | 29.5% |
| Missing | 48605 |
| Missing (%) | 62.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2433290.7 |
| Minimum | 43.48517 |
|---|---|
| Maximum | 4843106.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 43.48517 |
|---|---|
| 5-th percentile | 43.53859 |
| Q1 | 43.608514 |
| median | 4818092 |
| Q3 | 4829966.3 |
| 95-th percentile | 4838021.6 |
| Maximum | 4843106.9 |
| Range | 4843063.4 |
| Interquartile range (IQR) | 4829922.6 |
Descriptive statistics
| Standard deviation | 2414921.5 |
|---|---|
| Coefficient of variation (CV) | 0.99245088 |
| Kurtosis | -1.9998953 |
| Mean | 2433290.7 |
| Median Absolute Deviation (MAD) | 23561.033 |
| Skewness | -0.015148997 |
| Sum | 7.1604446 × 1010 |
| Variance | 5.8318459 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4827535.97 | 201 | 0.3% |
| 43.59351505 | 185 | 0.2% |
| 43.67999884 | 123 | 0.2% |
| 4838234.833 | 121 | 0.2% |
| 43.55837136 | 113 | 0.1% |
| 43.72011759 | 107 | 0.1% |
| 4823601.861 | 101 | 0.1% |
| 4841653.08 | 100 | 0.1% |
| 43.5935916 | 91 | 0.1% |
| 4827677.175 | 66 | 0.1% |
| Other values (8674) | 28219 | |
| (Missing) | 48605 |
| Value | Count | Frequency (%) |
| 43.48517014 | 1 | |
| 43.48968489 | 1 | |
| 43.4915708 | 1 | |
| 43.49199992 | 2 | |
| 43.49224252 | 1 | |
| 43.49454092 | 1 | |
| 43.49517064 | 1 | |
| 43.49608236 | 1 | |
| 43.49636475 | 1 | |
| 43.49652992 | 2 |
| Value | Count | Frequency (%) |
| 4843106.933 | 3 | |
| 4843045.912 | 1 | < 0.1% |
| 4842995.781 | 2 | |
| 4842852.901 | 1 | < 0.1% |
| 4842722.486 | 1 | < 0.1% |
| 4842531.982 | 2 | |
| 4842304.058 | 2 | |
| 4842274.717 | 1 | < 0.1% |
| 4842274.399 | 2 | |
| 4842200.556 | 2 |
FID
Real number (ℝ)
| Distinct | 16518 |
|---|---|
| Distinct (%) | 21.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7823.2043 |
| Minimum | 1 |
|---|---|
| Maximum | 16518 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 781 |
| Q1 | 3902 |
| median | 7804 |
| Q3 | 11705.25 |
| 95-th percentile | 14902 |
| Maximum | 16518 |
| Range | 16517 |
| Interquartile range (IQR) | 7803.25 |
Descriptive statistics
| Standard deviation | 4538.5029 |
|---|---|
| Coefficient of variation (CV) | 0.58013351 |
| Kurtosis | -1.1665353 |
| Mean | 7823.2043 |
| Median Absolute Deviation (MAD) | 3902 |
| Skewness | 0.024756244 |
| Sum | 6.1046028 × 108 |
| Variance | 20598009 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5 | < 0.1% |
| 9727 | 5 | < 0.1% |
| 9729 | 5 | < 0.1% |
| 9730 | 5 | < 0.1% |
| 9731 | 5 | < 0.1% |
| 9732 | 5 | < 0.1% |
| 9733 | 5 | < 0.1% |
| 9734 | 5 | < 0.1% |
| 9735 | 5 | < 0.1% |
| 9736 | 5 | < 0.1% |
| Other values (16508) | 77982 |
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 2 | 5 | |
| 3 | 5 | |
| 4 | 5 | |
| 5 | 5 | |
| 6 | 5 | |
| 7 | 5 | |
| 8 | 5 | |
| 9 | 5 | |
| 10 | 5 |
| Value | Count | Frequency (%) |
| 16518 | 1 | |
| 16517 | 1 | |
| 16516 | 1 | |
| 16515 | 1 | |
| 16514 | 1 | |
| 16513 | 1 | |
| 16512 | 1 | |
| 16511 | 1 | |
| 16510 | 1 | |
| 16509 | 1 |
BusinessID
Real number (ℝ)
| Distinct | 21240 |
|---|---|
| Distinct (%) | 27.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34656.267 |
| Minimum | 2 |
|---|---|
| Maximum | 94424 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2230 |
| Q1 | 9764 |
| median | 19182.5 |
| Q3 | 55026 |
| 95-th percentile | 88915 |
| Maximum | 94424 |
| Range | 94422 |
| Interquartile range (IQR) | 45262 |
Descriptive statistics
| Standard deviation | 29857.312 |
|---|---|
| Coefficient of variation (CV) | 0.86152708 |
| Kurtosis | -0.99364033 |
| Mean | 34656.267 |
| Median Absolute Deviation (MAD) | 16019.5 |
| Skewness | 0.65057392 |
| Sum | 2.7042978 × 109 |
| Variance | 8.9145909 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1055 | 5 | < 0.1% |
| 20882 | 5 | < 0.1% |
| 19580 | 5 | < 0.1% |
| 20871 | 5 | < 0.1% |
| 19831 | 5 | < 0.1% |
| 19332 | 5 | < 0.1% |
| 19583 | 5 | < 0.1% |
| 19832 | 5 | < 0.1% |
| 19584 | 5 | < 0.1% |
| 20872 | 5 | < 0.1% |
| Other values (21230) | 77982 |
| Value | Count | Frequency (%) |
| 2 | 2 | < 0.1% |
| 7 | 5 | |
| 10 | 5 | |
| 12 | 3 | |
| 16 | 5 | |
| 18 | 5 | |
| 20 | 5 | |
| 21 | 5 | |
| 23 | 5 | |
| 26 | 4 |
| Value | Count | Frequency (%) |
| 94424 | 1 | |
| 94423 | 1 | |
| 94419 | 1 | |
| 94371 | 1 | |
| 94321 | 1 | |
| 94319 | 1 | |
| 94318 | 1 | |
| 94317 | 1 | |
| 94313 | 1 | |
| 94293 | 1 |
Name
Categorical
| Distinct | 22710 |
|---|---|
| Distinct (%) | 29.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| Subway | 212 |
|---|---|
| Tim Hortons | 181 |
| Petro Canada | 123 |
| Shoppers Drug Mart | 102 |
| Tim Horton's | 97 |
| Other values (22705) |
Length
| Max length | 118 |
|---|---|
| Median length | 76 |
| Mean length | 22.654539 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1767779 |
|---|---|
| Distinct characters | 93 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 5010 ? |
|---|---|
| Unique (%) | 6.4% |
Sample
| 1st row | Golf Trends Inc. |
|---|---|
| 2nd row | Apex Graphics Inc. |
| 3rd row | Sands, John & Associates Limited |
| 4th row | Printmedia-Tackaberry Times |
| 5th row | S W R Industries Ltd. |
Common Values
| Value | Count | Frequency (%) |
| Subway | 212 | 0.3% |
| Tim Hortons | 181 | 0.2% |
| Petro Canada | 123 | 0.2% |
| Shoppers Drug Mart | 102 | 0.1% |
| Tim Horton's | 97 | 0.1% |
| PLASP Child Care Centre | 96 | 0.1% |
| Dollarama | 92 | 0.1% |
| Starbucks | 88 | 0.1% |
| Shell Canada | 84 | 0.1% |
| Royal Bank of Canada | 78 | 0.1% |
| Other values (22700) | 76879 |
Length
| Value | Count | Frequency (%) |
| inc | 15794 | 5.7% |
| 9127 | 3.3% | |
| ltd | 7946 | 2.9% |
| canada | 4795 | 1.7% |
| centre | 2969 | 1.1% |
| and | 2598 | 0.9% |
| services | 2443 | 0.9% |
| the | 2359 | 0.8% |
| a | 2092 | 0.8% |
| of | 2044 | 0.7% |
| Other values (16113) | 225478 |
Most occurring characters
| Value | Count | Frequency (%) |
| 199927 | 11.3% | |
| e | 132589 | 7.5% |
| a | 128136 | 7.2% |
| n | 115216 | 6.5% |
| i | 104250 | 5.9% |
| r | 101893 | 5.8% |
| o | 97613 | 5.5% |
| t | 94807 | 5.4% |
| s | 77470 | 4.4% |
| l | 62777 | 3.6% |
| Other values (83) | 653101 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1236769 | |
| Uppercase Letter | 275469 | 15.6% |
| Space Separator | 199927 | 11.3% |
| Other Punctuation | 44368 | 2.5% |
| Decimal Number | 4222 | 0.2% |
| Dash Punctuation | 4194 | 0.2% |
| Close Punctuation | 1272 | 0.1% |
| Open Punctuation | 1266 | 0.1% |
| Math Symbol | 178 | < 0.1% |
| Final Punctuation | 99 | < 0.1% |
| Other values (5) | 15 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 132589 | |
| a | 128136 | |
| n | 115216 | |
| i | 104250 | 8.4% |
| r | 101893 | 8.2% |
| o | 97613 | 7.9% |
| t | 94807 | 7.7% |
| s | 77470 | 6.3% |
| l | 62777 | 5.1% |
| c | 60202 | 4.9% |
| Other values (20) | 261816 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 35962 | |
| S | 28667 | 10.4% |
| I | 23883 | 8.7% |
| M | 18395 | 6.7% |
| L | 18128 | 6.6% |
| A | 17083 | 6.2% |
| P | 16975 | 6.2% |
| T | 15559 | 5.6% |
| D | 13515 | 4.9% |
| B | 11145 | 4.0% |
| Other values (17) | 76157 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 29521 | |
| & | 7166 | 16.2% |
| , | 3463 | 7.8% |
| ' | 3108 | 7.0% |
| / | 898 | 2.0% |
| : | 88 | 0.2% |
| # | 35 | 0.1% |
| @ | 29 | 0.1% |
| ! | 26 | 0.1% |
| " | 16 | < 0.1% |
| Other values (2) | 18 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 906 | |
| 2 | 760 | |
| 0 | 712 | |
| 4 | 418 | |
| 3 | 334 | 7.9% |
| 9 | 287 | 6.8% |
| 8 | 245 | 5.8% |
| 7 | 197 | 4.7% |
| 5 | 184 | 4.4% |
| 6 | 179 | 4.2% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 152 | |
| | | 25 | 14.0% |
| > | 1 | 0.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1264 | |
| ] | 8 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 199927 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4194 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1266 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 99 |
Control
| Value | Count | Frequency (%) |
| 6 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Format
| Value | Count | Frequency (%) |
| | 3 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 2 |
Other Symbol
| Value | Count | Frequency (%) |
| © | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1512238 | |
| Common | 255541 | 14.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 132589 | 8.8% |
| a | 128136 | 8.5% |
| n | 115216 | 7.6% |
| i | 104250 | 6.9% |
| r | 101893 | 6.7% |
| o | 97613 | 6.5% |
| t | 94807 | 6.3% |
| s | 77470 | 5.1% |
| l | 62777 | 4.2% |
| c | 60202 | 4.0% |
| Other values (47) | 537285 |
Common
| Value | Count | Frequency (%) |
| 199927 | ||
| . | 29521 | 11.6% |
| & | 7166 | 2.8% |
| - | 4194 | 1.6% |
| , | 3463 | 1.4% |
| ' | 3108 | 1.2% |
| ( | 1266 | 0.5% |
| ) | 1264 | 0.5% |
| 1 | 906 | 0.4% |
| / | 898 | 0.4% |
| Other values (26) | 3828 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1767601 | |
| Punctuation | 102 | < 0.1% |
| None | 76 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 199927 | 11.3% | |
| e | 132589 | 7.5% |
| a | 128136 | 7.2% |
| n | 115216 | 6.5% |
| i | 104250 | 5.9% |
| r | 101893 | 5.8% |
| o | 97613 | 5.5% |
| t | 94807 | 5.4% |
| s | 77470 | 4.4% |
| l | 62777 | 3.6% |
| Other values (75) | 652923 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 99 | |
| | 3 | 2.9% |
None
| Value | Count | Frequency (%) |
| é | 67 | |
| ü | 4 | 5.3% |
| ē | 2 | 2.6% |
| É | 1 | 1.3% |
| ä | 1 | 1.3% |
| © | 1 | 1.3% |
Address
Categorical
| Distinct | 6618 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 100 City Centre Dr | 953 |
|---|---|
| 5100 Erin Mills Pky | 523 |
| 7205 Goreway Dr | 483 |
| 1250 South Service Rd | 394 |
| 1550 South Gateway Rd | 284 |
| Other values (6613) |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 16.625525 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1297323 |
|---|---|
| Distinct characters | 64 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 292 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 300 Ambassador Dr |
|---|---|
| 2nd row | 320 Ambassador Dr |
| 3rd row | 320 Ambassador Dr |
| 4th row | 320 Ambassador Dr |
| 5th row | 321 Ambassador Dr |
Common Values
| Value | Count | Frequency (%) |
| 100 City Centre Dr | 953 | 1.2% |
| 5100 Erin Mills Pky | 523 | 0.7% |
| 7205 Goreway Dr | 483 | 0.6% |
| 1250 South Service Rd | 394 | 0.5% |
| 1550 South Gateway Rd | 284 | 0.4% |
| 4141 Dixie Rd | 248 | 0.3% |
| 2225 Erin Mills Pky | 238 | 0.3% |
| 50 Burnhamthorpe Rd W | 229 | 0.3% |
| 2355 Derry Rd E | 212 | 0.3% |
| 2000 Credit Valley Rd | 212 | 0.3% |
| Other values (6608) | 74256 |
Length
| Value | Count | Frequency (%) |
| rd | 28597 | 10.8% |
| dr | 17907 | 6.8% |
| e | 12047 | 4.6% |
| st | 9954 | 3.8% |
| blvd | 8013 | 3.0% |
| w | 7245 | 2.7% |
| dundas | 4805 | 1.8% |
| ave | 3977 | 1.5% |
| matheson | 2625 | 1.0% |
| pky | 2579 | 1.0% |
| Other values (3761) | 165836 |
Most occurring characters
| Value | Count | Frequency (%) |
| 185556 | 14.3% | |
| r | 77071 | 5.9% |
| e | 71979 | 5.5% |
| a | 58783 | 4.5% |
| d | 55945 | 4.3% |
| 0 | 51078 | 3.9% |
| n | 49722 | 3.8% |
| 5 | 48031 | 3.7% |
| t | 47992 | 3.7% |
| i | 45039 | 3.5% |
| Other values (54) | 606127 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 636946 | |
| Decimal Number | 287140 | |
| Uppercase Letter | 187144 | 14.4% |
| Space Separator | 185556 | 14.3% |
| Dash Punctuation | 480 | < 0.1% |
| Other Punctuation | 54 | < 0.1% |
| Modifier Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 77071 | |
| e | 71979 | |
| a | 58783 | |
| d | 55945 | |
| n | 49722 | 7.8% |
| t | 47992 | 7.5% |
| i | 45039 | 7.1% |
| o | 36413 | 5.7% |
| l | 32505 | 5.1% |
| s | 27700 | 4.3% |
| Other values (15) | 133797 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 31751 | |
| D | 29023 | |
| S | 18789 | |
| E | 16442 | |
| B | 14485 | |
| C | 13381 | |
| W | 11748 | 6.3% |
| M | 9512 | 5.1% |
| A | 9382 | 5.0% |
| T | 6499 | 3.5% |
| Other values (14) | 26132 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 51078 | |
| 5 | 48031 | |
| 1 | 41652 | |
| 2 | 31311 | |
| 3 | 25187 | |
| 6 | 23265 | |
| 7 | 20531 | |
| 4 | 17381 | 6.1% |
| 9 | 14549 | 5.1% |
| 8 | 14155 | 4.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 46 | |
| . | 8 | 14.8% |
Space Separator
| Value | Count | Frequency (%) |
| 185556 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 480 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 824090 | |
| Common | 473233 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 77071 | 9.4% |
| e | 71979 | 8.7% |
| a | 58783 | 7.1% |
| d | 55945 | 6.8% |
| n | 49722 | 6.0% |
| t | 47992 | 5.8% |
| i | 45039 | 5.5% |
| o | 36413 | 4.4% |
| l | 32505 | 3.9% |
| R | 31751 | 3.9% |
| Other values (39) | 316890 |
Common
| Value | Count | Frequency (%) |
| 185556 | ||
| 0 | 51078 | 10.8% |
| 5 | 48031 | 10.1% |
| 1 | 41652 | 8.8% |
| 2 | 31311 | 6.6% |
| 3 | 25187 | 5.3% |
| 6 | 23265 | 4.9% |
| 7 | 20531 | 4.3% |
| 4 | 17381 | 3.7% |
| 9 | 14549 | 3.1% |
| Other values (5) | 14692 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1297323 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 185556 | 14.3% | |
| r | 77071 | 5.9% |
| e | 71979 | 5.5% |
| a | 58783 | 4.5% |
| d | 55945 | 4.3% |
| 0 | 51078 | 3.9% |
| n | 49722 | 3.8% |
| 5 | 48031 | 3.7% |
| t | 47992 | 3.7% |
| i | 45039 | 3.5% |
| Other values (54) | 606127 |
StreetNo
Real number (ℝ)
| Distinct | 3090 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2946.1325 |
| Minimum | 1 |
|---|---|
| Maximum | 905629 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 57 |
| Q1 | 1050 |
| median | 2375 |
| Q3 | 5100 |
| 95-th percentile | 7070 |
| Maximum | 905629 |
| Range | 905628 |
| Interquartile range (IQR) | 4050 |
Descriptive statistics
| Standard deviation | 3997.6662 |
|---|---|
| Coefficient of variation (CV) | 1.35692 |
| Kurtosis | 33315.386 |
| Mean | 2946.1325 |
| Median Absolute Deviation (MAD) | 1655 |
| Skewness | 147.65244 |
| Sum | 2.2989261 × 108 |
| Variance | 15981335 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 1101 | 1.4% |
| 5100 | 601 | 0.8% |
| 7205 | 520 | 0.7% |
| 1250 | 448 | 0.6% |
| 1 | 442 | 0.6% |
| 2000 | 383 | 0.5% |
| 1550 | 359 | 0.5% |
| 50 | 313 | 0.4% |
| 4141 | 310 | 0.4% |
| 2425 | 304 | 0.4% |
| Other values (3080) | 73251 |
| Value | Count | Frequency (%) |
| 1 | 442 | |
| 2 | 198 | |
| 3 | 200 | |
| 4 | 154 | 0.2% |
| 5 | 7 | < 0.1% |
| 6 | 33 | < 0.1% |
| 7 | 25 | < 0.1% |
| 8 | 21 | < 0.1% |
| 9 | 20 | < 0.1% |
| 10 | 154 | 0.2% |
| Value | Count | Frequency (%) |
| 905629 | 1 | < 0.1% |
| 7895 | 138 | |
| 7890 | 7 | < 0.1% |
| 7885 | 79 | |
| 7880 | 6 | < 0.1% |
| 7875 | 30 | < 0.1% |
| 7860 | 5 | < 0.1% |
| 7855 | 5 | < 0.1% |
| 7850 | 4 | < 0.1% |
| 7840 | 1 | < 0.1% |
StreetName
Categorical
| Distinct | 669 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| Dundas St E | 3202 |
|---|---|
| Matheson Blvd E | 2125 |
| Dixie Rd | 1982 |
| Hurontario St | 1971 |
| Lakeshore Rd E | 1628 |
| Other values (664) |
Length
| Max length | 26 |
|---|---|
| Median length | 22 |
| Mean length | 11.945035 |
| Min length | 3 |
Characters and Unicode
| Total characters | 932095 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 57 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Ambassador Dr |
|---|---|
| 2nd row | Ambassador Dr |
| 3rd row | Ambassador Dr |
| 4th row | Ambassador Dr |
| 5th row | Ambassador Dr |
Common Values
| Value | Count | Frequency (%) |
| Dundas St E | 3202 | 4.1% |
| Matheson Blvd E | 2125 | 2.7% |
| Dixie Rd | 1982 | 2.5% |
| Hurontario St | 1971 | 2.5% |
| Lakeshore Rd E | 1628 | 2.1% |
| Dundas St W | 1586 | 2.0% |
| City Centre Dr | 1528 | 2.0% |
| Britannia Rd E | 1441 | 1.8% |
| Tomken Rd | 1416 | 1.8% |
| Argentia Rd | 1400 | 1.8% |
| Other values (659) | 59753 |
Length
| Value | Count | Frequency (%) |
| rd | 28598 | 15.4% |
| dr | 17906 | 9.7% |
| e | 12045 | 6.5% |
| st | 9954 | 5.4% |
| blvd | 8011 | 4.3% |
| w | 7247 | 3.9% |
| dundas | 4805 | 2.6% |
| ave | 3978 | 2.1% |
| matheson | 2625 | 1.4% |
| pky | 2575 | 1.4% |
| Other values (665) | 87802 |
Most occurring characters
| Value | Count | Frequency (%) |
| 107515 | 11.5% | |
| r | 77031 | 8.3% |
| e | 71980 | 7.7% |
| a | 58785 | 6.3% |
| d | 55948 | 6.0% |
| n | 49725 | 5.3% |
| t | 47986 | 5.1% |
| i | 45031 | 4.8% |
| o | 36410 | 3.9% |
| l | 32503 | 3.5% |
| Other values (43) | 349181 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 636923 | |
| Uppercase Letter | 187126 | 20.1% |
| Space Separator | 107515 | 11.5% |
| Dash Punctuation | 480 | 0.1% |
| Other Punctuation | 51 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 77031 | |
| e | 71980 | |
| a | 58785 | |
| d | 55948 | |
| n | 49725 | 7.8% |
| t | 47986 | 7.5% |
| i | 45031 | 7.1% |
| o | 36410 | 5.7% |
| l | 32503 | 5.1% |
| s | 27702 | 4.3% |
| Other values (15) | 133822 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 31747 | |
| D | 29017 | |
| S | 18788 | |
| E | 16439 | |
| B | 14481 | |
| C | 13374 | |
| W | 11747 | 6.3% |
| M | 9514 | 5.1% |
| A | 9382 | 5.0% |
| T | 6500 | 3.5% |
| Other values (14) | 26137 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 45 | |
| . | 6 | 11.8% |
Space Separator
| Value | Count | Frequency (%) |
| 107515 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 480 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 824049 | |
| Common | 108046 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 77031 | 9.3% |
| e | 71980 | 8.7% |
| a | 58785 | 7.1% |
| d | 55948 | 6.8% |
| n | 49725 | 6.0% |
| t | 47986 | 5.8% |
| i | 45031 | 5.5% |
| o | 36410 | 4.4% |
| l | 32503 | 3.9% |
| R | 31747 | 3.9% |
| Other values (39) | 316903 |
Common
| Value | Count | Frequency (%) |
| 107515 | ||
| - | 480 | 0.4% |
| ' | 45 | < 0.1% |
| . | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 932095 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 107515 | 11.5% | |
| r | 77031 | 8.3% |
| e | 71980 | 7.7% |
| a | 58785 | 6.3% |
| d | 55948 | 6.0% |
| n | 49725 | 5.3% |
| t | 47986 | 5.1% |
| i | 45031 | 4.8% |
| o | 36410 | 3.9% |
| l | 32503 | 3.5% |
| Other values (43) | 349181 |
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| Bldg 2 | 897 |
|---|---|
| Bldg 1 | 858 |
| Bldg A | 426 |
| Bldg B | 348 |
| Other values (89) | 1705 |
Length
| Max length | 18 |
|---|---|
| Median length | 1 |
| Mean length | 1.2798339 |
| Min length | 1 |
Characters and Unicode
| Total characters | 99868 |
|---|---|
| Distinct characters | 53 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 24 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 73798 | ||
| Bldg 2 | 897 | 1.1% |
| Bldg 1 | 858 | 1.1% |
| Bldg A | 426 | 0.5% |
| Bldg B | 348 | 0.4% |
| Bldg 3 | 292 | 0.4% |
| Bldg 4 | 221 | 0.3% |
| Bldg K | 135 | 0.2% |
| Bldg C | 97 | 0.1% |
| East Tower | 67 | 0.1% |
| Other values (84) | 893 | 1.1% |
Length
| Value | Count | Frequency (%) |
| bldg | 3720 | |
| 1 | 943 | 11.3% |
| 2 | 941 | 11.2% |
| a | 448 | 5.3% |
| b | 372 | 4.4% |
| 3 | 321 | 3.8% |
| 4 | 276 | 3.3% |
| plaza | 169 | 2.0% |
| k | 135 | 1.6% |
| tower | 118 | 1.4% |
| Other values (58) | 931 | 11.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 77939 | ||
| B | 4161 | 4.2% |
| l | 3969 | 4.0% |
| g | 3806 | 3.8% |
| d | 3752 | 3.8% |
| 1 | 1103 | 1.1% |
| 2 | 1002 | 1.0% |
| a | 514 | 0.5% |
| A | 454 | 0.5% |
| 3 | 326 | 0.3% |
| Other values (43) | 2842 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 77939 | |
| Lowercase Letter | 13394 | 13.4% |
| Uppercase Letter | 5595 | 5.6% |
| Decimal Number | 2933 | 2.9% |
| Other Punctuation | 5 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 4161 | |
| A | 454 | 8.1% |
| P | 170 | 3.0% |
| K | 135 | 2.4% |
| E | 119 | 2.1% |
| T | 115 | 2.1% |
| C | 106 | 1.9% |
| H | 83 | 1.5% |
| D | 57 | 1.0% |
| W | 51 | 0.9% |
| Other values (10) | 144 | 2.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 3969 | |
| g | 3806 | |
| d | 3752 | |
| a | 514 | 3.8% |
| e | 269 | 2.0% |
| r | 225 | 1.7% |
| z | 169 | 1.3% |
| o | 151 | 1.1% |
| t | 149 | 1.1% |
| s | 121 | 0.9% |
| Other values (10) | 269 | 2.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1103 | |
| 2 | 1002 | |
| 3 | 326 | 11.1% |
| 4 | 279 | 9.5% |
| 9 | 45 | 1.5% |
| 6 | 43 | 1.5% |
| 5 | 40 | 1.4% |
| 7 | 39 | 1.3% |
| 0 | 33 | 1.1% |
| 8 | 23 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 77939 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 80879 | |
| Latin | 18989 | 19.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 4161 | |
| l | 3969 | |
| g | 3806 | |
| d | 3752 | |
| a | 514 | 2.7% |
| A | 454 | 2.4% |
| e | 269 | 1.4% |
| r | 225 | 1.2% |
| P | 170 | 0.9% |
| z | 169 | 0.9% |
| Other values (30) | 1500 | 7.9% |
Common
| Value | Count | Frequency (%) |
| 77939 | ||
| 1 | 1103 | 1.4% |
| 2 | 1002 | 1.2% |
| 3 | 326 | 0.4% |
| 4 | 279 | 0.3% |
| 9 | 45 | 0.1% |
| 6 | 43 | 0.1% |
| 5 | 40 | < 0.1% |
| 7 | 39 | < 0.1% |
| 0 | 33 | < 0.1% |
| Other values (3) | 30 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 99868 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 77939 | ||
| B | 4161 | 4.2% |
| l | 3969 | 4.0% |
| g | 3806 | 3.8% |
| d | 3752 | 3.8% |
| 1 | 1103 | 1.1% |
| 2 | 1002 | 1.0% |
| a | 514 | 0.5% |
| A | 454 | 0.5% |
| 3 | 326 | 0.3% |
| Other values (43) | 2842 | 2.8% |
UnitNo
Categorical
| Distinct | 3322 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 1 | 2763 |
|---|---|
| 2 | 2226 |
| 3 | 1941 |
| 4 | 1823 |
| Other values (3317) |
Length
| Max length | 39 |
|---|---|
| Median length | 1 |
| Mean length | 2.2311488 |
| Min length | 1 |
Characters and Unicode
| Total characters | 174101 |
|---|---|
| Distinct characters | 69 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1140 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 24367 | ||
| 1 | 2763 | 3.5% |
| 2 | 2226 | 2.9% |
| 3 | 1941 | 2.5% |
| 4 | 1823 | 2.3% |
| 5 | 1597 | 2.0% |
| 6 | 1483 | 1.9% |
| 7 | 1286 | 1.6% |
| 8 | 1182 | 1.5% |
| 9 | 993 | 1.3% |
| Other values (3312) | 38371 |
Length
| Value | Count | Frequency (%) |
| 1 | 3419 | 5.4% |
| to | 2757 | 4.4% |
| 2 | 2631 | 4.2% |
| 3 | 2352 | 3.7% |
| 4 | 2199 | 3.5% |
| 5 | 2002 | 3.2% |
| 6 | 1816 | 2.9% |
| 7 | 1704 | 2.7% |
| 8 | 1577 | 2.5% |
| 9 | 1315 | 2.1% |
| Other values (2160) | 41469 |
Most occurring characters
| Value | Count | Frequency (%) |
| 34098 | ||
| 1 | 28398 | |
| 2 | 18376 | |
| 0 | 18283 | |
| 3 | 10149 | 5.8% |
| 4 | 8314 | 4.8% |
| 5 | 7050 | 4.0% |
| 6 | 5941 | 3.4% |
| 7 | 5008 | 2.9% |
| 8 | 4658 | 2.7% |
| Other values (59) | 33826 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 109874 | |
| Space Separator | 34098 | 19.6% |
| Lowercase Letter | 12491 | 7.2% |
| Uppercase Letter | 10697 | 6.1% |
| Other Punctuation | 4969 | 2.9% |
| Dash Punctuation | 1812 | 1.0% |
| Close Punctuation | 70 | < 0.1% |
| Open Punctuation | 70 | < 0.1% |
| Math Symbol | 15 | < 0.1% |
| Control | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2952 | |
| B | 2399 | |
| C | 992 | 9.3% |
| F | 814 | 7.6% |
| D | 563 | 5.3% |
| E | 562 | 5.3% |
| H | 362 | 3.4% |
| L | 333 | 3.1% |
| G | 324 | 3.0% |
| J | 189 | 1.8% |
| Other values (15) | 1207 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3780 | |
| t | 3301 | |
| r | 843 | 6.7% |
| l | 756 | 6.1% |
| e | 737 | 5.9% |
| n | 464 | 3.7% |
| a | 444 | 3.6% |
| s | 410 | 3.3% |
| p | 278 | 2.2% |
| d | 261 | 2.1% |
| Other values (13) | 1217 | 9.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 28398 | |
| 2 | 18376 | |
| 0 | 18283 | |
| 3 | 10149 | 9.2% |
| 4 | 8314 | 7.6% |
| 5 | 7050 | 6.4% |
| 6 | 5941 | 5.4% |
| 7 | 5008 | 4.6% |
| 8 | 4658 | 4.2% |
| 9 | 3697 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 3862 | |
| , | 1058 | 21.3% |
| . | 28 | 0.6% |
| / | 20 | 0.4% |
| … | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 34098 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1812 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 70 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 70 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 15 |
Control
| Value | Count | Frequency (%) |
| 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 150913 | |
| Latin | 23188 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3780 | |
| t | 3301 | |
| A | 2952 | |
| B | 2399 | 10.3% |
| C | 992 | 4.3% |
| r | 843 | 3.6% |
| F | 814 | 3.5% |
| l | 756 | 3.3% |
| e | 737 | 3.2% |
| D | 563 | 2.4% |
| Other values (38) | 6051 |
Common
| Value | Count | Frequency (%) |
| 34098 | ||
| 1 | 28398 | |
| 2 | 18376 | |
| 0 | 18283 | |
| 3 | 10149 | 6.7% |
| 4 | 8314 | 5.5% |
| 5 | 7050 | 4.7% |
| 6 | 5941 | 3.9% |
| 7 | 5008 | 3.3% |
| 8 | 4658 | 3.1% |
| Other values (11) | 10638 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 174100 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 34098 | ||
| 1 | 28398 | |
| 2 | 18376 | |
| 0 | 18283 | |
| 3 | 10149 | 5.8% |
| 4 | 8314 | 4.8% |
| 5 | 7050 | 4.0% |
| 6 | 5941 | 3.4% |
| 7 | 5008 | 2.9% |
| 8 | 4658 | 2.7% |
| Other values (58) | 33825 |
Punctuation
| Value | Count | Frequency (%) |
| … | 1 |
PostalCode
Categorical
| Distinct | 2901 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| L5B 2C9 | 768 |
|---|---|
| L5M 4Z5 | 523 |
| L4T 2T9 | 477 |
| L5E 1V4 | 394 |
| L5P 1B2 | 386 |
| Other values (2896) |
Length
| Max length | 33 |
|---|---|
| Median length | 7 |
| Mean length | 6.995425 |
| Min length | 1 |
Characters and Unicode
| Total characters | 545867 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 138 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | L5T 2J3 |
|---|---|
| 2nd row | L5T 2J3 |
| 3rd row | L5T 2J3 |
| 4th row | L5T 2J3 |
| 5th row | L5T 2J3 |
Common Values
| Value | Count | Frequency (%) |
| L5B 2C9 | 768 | 1.0% |
| L5M 4Z5 | 523 | 0.7% |
| L4T 2T9 | 477 | 0.6% |
| L5E 1V4 | 394 | 0.5% |
| L5P 1B2 | 386 | 0.5% |
| L5C 1V8 | 332 | 0.4% |
| L5J 1K5 | 296 | 0.4% |
| L4W 5G6 | 284 | 0.4% |
| L4X 1L4 | 249 | 0.3% |
| L5B 1M7 | 247 | 0.3% |
| Other values (2891) | 74076 |
Length
| Value | Count | Frequency (%) |
| l4w | 12403 | 8.0% |
| l5t | 8317 | 5.3% |
| l5n | 6069 | 3.9% |
| l4z | 4948 | 3.2% |
| l5l | 4693 | 3.0% |
| l5b | 4588 | 2.9% |
| l5s | 4258 | 2.7% |
| l5m | 3801 | 2.4% |
| l4t | 3311 | 2.1% |
| l5a | 3290 | 2.1% |
| Other values (1077) | 100200 |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 86506 | |
| 77968 | ||
| 5 | 63752 | |
| 4 | 47370 | 8.7% |
| 1 | 39205 | 7.2% |
| 2 | 25913 | 4.7% |
| 3 | 16425 | 3.0% |
| W | 16127 | 3.0% |
| T | 14622 | 2.7% |
| 6 | 11449 | 2.1% |
| Other values (37) | 146530 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 233941 | |
| Decimal Number | 233912 | |
| Space Separator | 77968 | 14.3% |
| Lowercase Letter | 32 | < 0.1% |
| Control | 14 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 86506 | |
| W | 16127 | 6.9% |
| T | 14622 | 6.3% |
| N | 9608 | 4.1% |
| A | 9326 | 4.0% |
| B | 8748 | 3.7% |
| Z | 8458 | 3.6% |
| M | 7909 | 3.4% |
| C | 7879 | 3.4% |
| V | 7750 | 3.3% |
| Other values (12) | 57008 |
Lowercase Letter
| Value | Count | Frequency (%) |
| k | 9 | |
| c | 5 | |
| l | 5 | |
| s | 3 | 9.4% |
| d | 2 | 6.2% |
| t | 2 | 6.2% |
| h | 1 | 3.1% |
| i | 1 | 3.1% |
| a | 1 | 3.1% |
| g | 1 | 3.1% |
| Other values (2) | 2 | 6.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 63752 | |
| 4 | 47370 | |
| 1 | 39205 | |
| 2 | 25913 | |
| 3 | 16425 | 7.0% |
| 6 | 11449 | 4.9% |
| 8 | 9658 | 4.1% |
| 9 | 8878 | 3.8% |
| 7 | 8525 | 3.6% |
| 0 | 2737 | 1.2% |
Control
| Value | Count | Frequency (%) |
| 8 | ||
| 6 |
Space Separator
| Value | Count | Frequency (%) |
| 77968 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 311894 | |
| Latin | 233973 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 86506 | |
| W | 16127 | 6.9% |
| T | 14622 | 6.2% |
| N | 9608 | 4.1% |
| A | 9326 | 4.0% |
| B | 8748 | 3.7% |
| Z | 8458 | 3.6% |
| M | 7909 | 3.4% |
| C | 7879 | 3.4% |
| V | 7750 | 3.3% |
| Other values (24) | 57040 |
Common
| Value | Count | Frequency (%) |
| 77968 | ||
| 5 | 63752 | |
| 4 | 47370 | |
| 1 | 39205 | |
| 2 | 25913 | 8.3% |
| 3 | 16425 | 5.3% |
| 6 | 11449 | 3.7% |
| 8 | 9658 | 3.1% |
| 9 | 8878 | 2.8% |
| 7 | 8525 | 2.7% |
| Other values (3) | 2751 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 545867 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| L | 86506 | |
| 77968 | ||
| 5 | 63752 | |
| 4 | 47370 | 8.7% |
| 1 | 39205 | 7.2% |
| 2 | 25913 | 4.7% |
| 3 | 16425 | 3.0% |
| W | 16127 | 3.0% |
| T | 14622 | 2.7% |
| 6 | 11449 | 2.1% |
| Other values (37) | 146530 |
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 47693 |
| Missing (%) | 61.1% |
| Memory size | 609.8 KiB |
| Northeast EA (West) | |
|---|---|
| Gateway EA (East) | |
| Dixie EA | |
| Meadowvale Business Park CC | |
| Western Business Park EA | |
| Other values (51) |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 16.483866 |
| Min length | 7 |
Characters and Unicode
| Total characters | 500104 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Gateway EA (East) |
|---|---|
| 2nd row | Gateway EA (East) |
| 3rd row | Gateway EA (East) |
| 4th row | Gateway EA (East) |
| 5th row | Gateway EA (East) |
Common Values
| Value | Count | Frequency (%) |
| Northeast EA (West) | 8087 | 10.4% |
| Gateway EA (East) | 1828 | 2.3% |
| Dixie EA | 1814 | 2.3% |
| Meadowvale Business Park CC | 1734 | 2.2% |
| Western Business Park EA | 1580 | 2.0% |
| DT Core | 1256 | 1.6% |
| DT Cooksville | 931 | 1.2% |
| Airport CC | 906 | 1.2% |
| Northeast EA (East) | 738 | 0.9% |
| Mavis-Erindale EA | 719 | 0.9% |
| Other values (46) | 10746 | 13.8% |
| (Missing) | 47693 |
Length
| Value | Count | Frequency (%) |
| ea | 15721 | |
| northeast | 8825 | 10.5% |
| west | 8730 | 10.4% |
| nhd | 5805 | 6.9% |
| park | 3715 | 4.4% |
| east | 3604 | 4.3% |
| business | 3314 | 3.9% |
| cc | 3101 | 3.7% |
| gateway | 2618 | 3.1% |
| dt | 2576 | 3.1% |
| Other values (45) | 25930 |
Most occurring characters
| Value | Count | Frequency (%) |
| 53600 | 10.7% | |
| e | 44801 | 9.0% |
| t | 42033 | 8.4% |
| s | 38109 | 7.6% |
| a | 32858 | 6.6% |
| r | 25884 | 5.2% |
| o | 23256 | 4.7% |
| E | 21305 | 4.3% |
| i | 18674 | 3.7% |
| A | 17879 | 3.6% |
| Other values (33) | 181705 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 300748 | |
| Uppercase Letter | 120982 | |
| Space Separator | 53600 | 10.7% |
| Open Punctuation | 11741 | 2.3% |
| Close Punctuation | 11741 | 2.3% |
| Dash Punctuation | 1292 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 44801 | |
| t | 42033 | |
| s | 38109 | |
| a | 32858 | |
| r | 25884 | |
| o | 23256 | |
| i | 18674 | |
| l | 13559 | 4.5% |
| n | 10785 | 3.6% |
| h | 10586 | 3.5% |
| Other values (11) | 40203 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 21305 | |
| A | 17879 | |
| N | 17487 | |
| C | 14057 | |
| W | 10310 | |
| D | 10195 | |
| H | 6417 | 5.3% |
| M | 5583 | 4.6% |
| P | 4710 | 3.9% |
| B | 3314 | 2.7% |
| Other values (8) | 9725 |
Space Separator
| Value | Count | Frequency (%) |
| 53600 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 11741 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 11741 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1292 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 421730 | |
| Common | 78374 | 15.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 44801 | 10.6% |
| t | 42033 | 10.0% |
| s | 38109 | 9.0% |
| a | 32858 | 7.8% |
| r | 25884 | 6.1% |
| o | 23256 | 5.5% |
| E | 21305 | 5.1% |
| i | 18674 | 4.4% |
| A | 17879 | 4.2% |
| N | 17487 | 4.1% |
| Other values (29) | 139444 |
Common
| Value | Count | Frequency (%) |
| 53600 | ||
| ( | 11741 | 15.0% |
| ) | 11741 | 15.0% |
| - | 1292 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 500104 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 53600 | 10.7% | |
| e | 44801 | 9.0% |
| t | 42033 | 8.4% |
| s | 38109 | 7.6% |
| a | 32858 | 6.6% |
| r | 25884 | 5.2% |
| o | 23256 | 4.7% |
| E | 21305 | 4.3% |
| i | 18674 | 3.7% |
| A | 17879 | 3.6% |
| Other values (33) | 181705 |
Ward
Real number (ℝ)
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.3913395 |
| Minimum | 1 |
|---|---|
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 11 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.4758594 |
|---|---|
| Coefficient of variation (CV) | 0.459229 |
| Kurtosis | 0.01057504 |
| Mean | 5.3913395 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.34308626 |
| Sum | 420697 |
| Variance | 6.12988 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 33956 | |
| 1 | 6772 | 8.7% |
| 8 | 6086 | 7.8% |
| 7 | 5561 | 7.1% |
| 3 | 5005 | 6.4% |
| 9 | 4687 | 6.0% |
| 11 | 4300 | 5.5% |
| 4 | 4163 | 5.3% |
| 6 | 3584 | 4.6% |
| 2 | 3163 | 4.1% |
| Value | Count | Frequency (%) |
| 1 | 6772 | 8.7% |
| 2 | 3163 | 4.1% |
| 3 | 5005 | 6.4% |
| 4 | 4163 | 5.3% |
| 5 | 33956 | |
| 6 | 3584 | 4.6% |
| 7 | 5561 | 7.1% |
| 8 | 6086 | 7.8% |
| 9 | 4687 | 6.0% |
| 10 | 755 | 1.0% |
| Value | Count | Frequency (%) |
| 11 | 4300 | 5.5% |
| 10 | 755 | 1.0% |
| 9 | 4687 | 6.0% |
| 8 | 6086 | 7.8% |
| 7 | 5561 | 7.1% |
| 6 | 3584 | 4.6% |
| 5 | 33956 | |
| 4 | 4163 | 5.3% |
| 3 | 5005 | 6.4% |
| 2 | 3163 | 4.1% |
NAICSCode
Real number (ℝ)
| Distinct | 715 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 532884.63 |
| Minimum | 23829 |
|---|---|
| Maximum | 913910 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 23829 |
|---|---|
| 5-th percentile | 315239 |
| Q1 | 417930 |
| median | 524210 |
| Q3 | 621330 |
| 95-th percentile | 812116 |
| Maximum | 913910 |
| Range | 890081 |
| Interquartile range (IQR) | 203400 |
Descriptive statistics
| Standard deviation | 158671.14 |
|---|---|
| Coefficient of variation (CV) | 0.29775891 |
| Kurtosis | -0.65947378 |
| Mean | 532884.63 |
| Median Absolute Deviation (MAD) | 97300 |
| Skewness | 0.31162396 |
| Sum | 4.1582053 × 1010 |
| Variance | 2.5176532 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 722512 | 3657 | 4.7% |
| 811111 | 1997 | 2.6% |
| 722511 | 1782 | 2.3% |
| 621210 | 1610 | 2.1% |
| 621110 | 1513 | 1.9% |
| 541110 | 1384 | 1.8% |
| 812115 | 1308 | 1.7% |
| 488519 | 1264 | 1.6% |
| 611110 | 1242 | 1.6% |
| 813110 | 1101 | 1.4% |
| Other values (705) | 61174 |
| Value | Count | Frequency (%) |
| 23829 | 1 | < 0.1% |
| 44612 | 3 | |
| 44812 | 1 | < 0.1% |
| 54111 | 4 | |
| 111999 | 1 | < 0.1% |
| 112999 | 3 | |
| 115110 | 2 | < 0.1% |
| 212299 | 6 | |
| 213118 | 3 | |
| 213119 | 6 |
| Value | Count | Frequency (%) |
| 913910 | 103 | |
| 913140 | 101 | |
| 913130 | 6 | < 0.1% |
| 912910 | 37 | < 0.1% |
| 912210 | 27 | < 0.1% |
| 912190 | 12 | < 0.1% |
| 912150 | 3 | < 0.1% |
| 912130 | 5 | < 0.1% |
| 912120 | 3 | < 0.1% |
| 912110 | 1 | < 0.1% |
NAICSCat
Categorical
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| Manufacturing | |
|---|---|
| Other Services | |
| Retail | |
| Wholesale | |
| Professional | |
| Other values (28) |
Length
| Max length | 50 |
|---|---|
| Median length | 39 |
| Mean length | 13.436295 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1048461 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Wholesale |
|---|---|
| 2nd row | Manufacturing |
| 3rd row | Manufacturing |
| 4th row | Manufacturing |
| 5th row | Wholesale |
Common Values
| Value | Count | Frequency (%) |
| Manufacturing | 9682 | |
| Other Services | 9053 | |
| Retail | 8775 | |
| Wholesale | 6955 | 8.9% |
| Professional | 5672 | 7.3% |
| Health Care | 5141 | 6.6% |
| Accommodation | 4936 | 6.3% |
| Transportation | 3046 | 3.9% |
| Construction | 2783 | 3.6% |
| Educational | 2438 | 3.1% |
| Other values (23) | 19551 |
Length
| Value | Count | Frequency (%) |
| services | 12307 | 10.0% |
| retail | 11071 | 9.0% |
| manufacturing | 9682 | 7.9% |
| other | 9053 | 7.4% |
| wholesale | 8749 | 7.1% |
| and | 7484 | 6.1% |
| professional | 7102 | 5.8% |
| health | 6459 | 5.3% |
| care | 6459 | 5.3% |
| accommodation | 6148 | 5.0% |
| Other values (36) | 37992 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 109575 | 10.5% |
| e | 105760 | 10.1% |
| i | 79617 | 7.6% |
| n | 77107 | 7.4% |
| t | 76590 | 7.3% |
| r | 66562 | 6.3% |
| o | 64715 | 6.2% |
| s | 54566 | 5.2% |
| c | 52730 | 5.0% |
| l | 50880 | 4.9% |
| Other values (27) | 310359 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 886161 | |
| Uppercase Letter | 115659 | 11.0% |
| Space Separator | 44474 | 4.2% |
| Other Punctuation | 2167 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 109575 | |
| e | 105760 | |
| i | 79617 | |
| n | 77107 | |
| t | 76590 | |
| r | 66562 | |
| o | 64715 | |
| s | 54566 | 6.2% |
| c | 52730 | 6.0% |
| l | 50880 | 5.7% |
| Other values (10) | 148059 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 15580 | |
| R | 14001 | |
| A | 12313 | |
| M | 10711 | |
| W | 10015 | |
| C | 9462 | |
| T | 9307 | |
| O | 9053 | |
| P | 7583 | |
| H | 6459 | |
| Other values (5) | 11175 |
Space Separator
| Value | Count | Frequency (%) |
| 44474 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2167 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1001820 | |
| Common | 46641 | 4.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 109575 | |
| e | 105760 | |
| i | 79617 | 7.9% |
| n | 77107 | 7.7% |
| t | 76590 | 7.6% |
| r | 66562 | 6.6% |
| o | 64715 | 6.5% |
| s | 54566 | 5.4% |
| c | 52730 | 5.3% |
| l | 50880 | 5.1% |
| Other values (25) | 263718 |
Common
| Value | Count | Frequency (%) |
| 44474 | ||
| , | 2167 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1048461 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 109575 | 10.5% |
| e | 105760 | 10.1% |
| i | 79617 | 7.6% |
| n | 77107 | 7.4% |
| t | 76590 | 7.3% |
| r | 66562 | 6.3% |
| o | 64715 | 6.2% |
| s | 54566 | 5.2% |
| c | 52730 | 5.0% |
| l | 50880 | 4.9% |
| Other values (27) | 310359 |
NAICSDescr
Categorical
| Distinct | 1039 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| Limited-service eating places | 3647 |
|---|---|
| General Automotive Repair | 1992 |
| Full-service restaurants | 1777 |
| Offices of Dentists | 1603 |
| Offices of Physicians | 1504 |
| Other values (1034) |
Length
| Max length | 175 |
|---|---|
| Median length | 80 |
| Mean length | 35.436385 |
| Min length | 6 |
Characters and Unicode
| Total characters | 2765172 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 124 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Amusement and Sporting Goods Wholesaler-Distributors |
|---|---|
| 2nd row | Support Activities for Printing |
| 3rd row | Support Activities for Printing |
| 4th row | Other Printing |
| 5th row | Industrial Machinery, Equipment and Supplies Wholesaler-Distributors |
Common Values
| Value | Count | Frequency (%) |
| Limited-service eating places | 3647 | 4.7% |
| General Automotive Repair | 1992 | 2.6% |
| Full-service restaurants | 1777 | 2.3% |
| Offices of Dentists | 1603 | 2.1% |
| Offices of Physicians | 1504 | 1.9% |
| Offices of Lawyers | 1376 | 1.8% |
| Beauty Salons | 1302 | 1.7% |
| Other Freight Transportation Arrangement | 1255 | 1.6% |
| Elementary and Secondary Schools | 1240 | 1.6% |
| Religious Organizations | 1098 | 1.4% |
| Other values (1029) | 61238 |
Length
| Value | Count | Frequency (%) |
| and | 33347 | 10.0% |
| other | 18681 | 5.6% |
| stores | 9245 | 2.8% |
| offices | 8694 | 2.6% |
| of | 8405 | 2.5% |
| services | 8315 | 2.5% |
| all | 8273 | 2.5% |
| wholesaler-distributors | 7178 | 2.1% |
| manufacturing | 6730 | 2.0% |
| supplies | 4486 | 1.3% |
| Other values (1054) | 221747 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 278627 | 10.1% |
| 258164 | 9.3% | |
| i | 198022 | 7.2% |
| r | 189307 | 6.8% |
| n | 183101 | 6.6% |
| t | 181749 | 6.6% |
| a | 181007 | 6.5% |
| s | 160174 | 5.8% |
| o | 139412 | 5.0% |
| l | 115516 | 4.2% |
| Other values (51) | 880093 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2193494 | |
| Uppercase Letter | 276079 | 10.0% |
| Space Separator | 258605 | 9.4% |
| Dash Punctuation | 17709 | 0.6% |
| Other Punctuation | 11390 | 0.4% |
| Open Punctuation | 4149 | 0.2% |
| Close Punctuation | 3340 | 0.1% |
| Control | 406 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 278627 | |
| i | 198022 | |
| r | 189307 | |
| n | 183101 | 8.3% |
| t | 181749 | 8.3% |
| a | 181007 | 8.3% |
| s | 160174 | 7.3% |
| o | 139412 | 6.4% |
| l | 115516 | 5.3% |
| c | 105666 | 4.8% |
| Other values (16) | 460913 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 38648 | |
| O | 30856 | |
| A | 24817 | 9.0% |
| C | 24436 | 8.9% |
| M | 21775 | 7.9% |
| P | 18986 | 6.9% |
| D | 14648 | 5.3% |
| W | 12588 | 4.6% |
| E | 11736 | 4.3% |
| F | 11266 | 4.1% |
| Other values (15) | 66323 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 9665 | |
| ' | 803 | 7.1% |
| & | 488 | 4.3% |
| . | 434 | 3.8% |
Space Separator
| Value | Count | Frequency (%) |
| 258164 | ||
| 441 | 0.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17709 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4149 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3340 |
Control
| Value | Count | Frequency (%) |
| 406 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2469573 | |
| Common | 295599 | 10.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 278627 | 11.3% |
| i | 198022 | 8.0% |
| r | 189307 | 7.7% |
| n | 183101 | 7.4% |
| t | 181749 | 7.4% |
| a | 181007 | 7.3% |
| s | 160174 | 6.5% |
| o | 139412 | 5.6% |
| l | 115516 | 4.7% |
| c | 105666 | 4.3% |
| Other values (41) | 736992 |
Common
| Value | Count | Frequency (%) |
| 258164 | ||
| - | 17709 | 6.0% |
| , | 9665 | 3.3% |
| ( | 4149 | 1.4% |
| ) | 3340 | 1.1% |
| ' | 803 | 0.3% |
| & | 488 | 0.2% |
| 441 | 0.1% | |
| . | 434 | 0.1% |
| 406 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2764731 | |
| None | 441 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 278627 | 10.1% |
| 258164 | 9.3% | |
| i | 198022 | 7.2% |
| r | 189307 | 6.8% |
| n | 183101 | 6.6% |
| t | 181749 | 6.6% |
| a | 181007 | 6.5% |
| s | 160174 | 5.8% |
| o | 139412 | 5.0% |
| l | 115516 | 4.2% |
| Other values (50) | 879652 |
None
| Value | Count | Frequency (%) |
| 441 |
Phone
Categorical
| Distinct | 25064 |
|---|---|
| Distinct (%) | 32.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 1457 | |
| 905-615-3200 | 40 |
| 905-624-3811 | 35 |
| 000-000-0000 | 35 |
| 905-615-3777 | 24 |
| Other values (25059) |
Length
| Max length | 20 |
|---|---|
| Median length | 12 |
| Mean length | 11.66665 |
| Min length | 1 |
Characters and Unicode
| Total characters | 910372 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 7404 ? |
|---|---|
| Unique (%) | 9.5% |
Sample
| 1st row | 905-795-8900 |
|---|---|
| 2nd row | 905-795-9575 |
| 3rd row | 905-795-9519 |
| 4th row | 905-564-8121 |
| 5th row | 905-564-8080 |
Common Values
| Value | Count | Frequency (%) |
| 1457 | 1.9% | |
| 905-615-3200 | 40 | 0.1% |
| 905-624-3811 | 35 | < 0.1% |
| 000-000-0000 | 35 | < 0.1% |
| 905-615-3777 | 24 | < 0.1% |
| 905-677-9354 | 21 | < 0.1% |
| 905-670-4070 | 20 | < 0.1% |
| 905-615-4640 | 20 | < 0.1% |
| 905-615-4750 | 20 | < 0.1% |
| 905-615-4653 | 18 | < 0.1% |
| Other values (25054) | 76342 |
Length
| Value | Count | Frequency (%) |
| 905-615-3200 | 40 | 0.1% |
| 000-000-0000 | 35 | < 0.1% |
| 905-624-3811 | 35 | < 0.1% |
| 905-615-3777 | 24 | < 0.1% |
| 905-677-9354 | 21 | < 0.1% |
| 905-670-4070 | 20 | < 0.1% |
| 905-615-4640 | 20 | < 0.1% |
| 905-615-4750 | 20 | < 0.1% |
| 905-615-4653 | 18 | < 0.1% |
| 905-949-2222 | 17 | < 0.1% |
| Other values (25058) | 76339 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 143126 | |
| 0 | 136708 | |
| 5 | 117584 | |
| 9 | 114775 | |
| 2 | 71077 | |
| 6 | 70911 | |
| 7 | 60427 | |
| 8 | 60294 | |
| 1 | 49065 | 5.4% |
| 4 | 46596 | 5.1% |
| Other values (11) | 39809 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 765753 | |
| Dash Punctuation | 143130 | 15.7% |
| Space Separator | 1471 | 0.2% |
| Other Punctuation | 9 | < 0.1% |
| Lowercase Letter | 7 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 136708 | |
| 5 | 117584 | |
| 9 | 114775 | |
| 2 | 71077 | |
| 6 | 70911 | |
| 7 | 60427 | |
| 8 | 60294 | |
| 1 | 49065 | 6.4% |
| 4 | 46596 | 6.1% |
| 3 | 38316 | 5.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2 | |
| x | 2 | |
| t | 2 | |
| e | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 143126 | |
| – | 4 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6 | |
| ; | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| B | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1471 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 910363 | |
| Latin | 9 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 143126 | |
| 0 | 136708 | |
| 5 | 117584 | |
| 9 | 114775 | |
| 2 | 71077 | |
| 6 | 70911 | |
| 7 | 60427 | |
| 8 | 60294 | |
| 1 | 49065 | 5.4% |
| 4 | 46596 | 5.1% |
| Other values (5) | 39800 | 4.4% |
Latin
| Value | Count | Frequency (%) |
| o | 2 | |
| x | 2 | |
| t | 2 | |
| E | 1 | |
| e | 1 | |
| B | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 910368 | |
| Punctuation | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 143126 | |
| 0 | 136708 | |
| 5 | 117584 | |
| 9 | 114775 | |
| 2 | 71077 | |
| 6 | 70911 | |
| 7 | 60427 | |
| 8 | 60294 | |
| 1 | 49065 | 5.4% |
| 4 | 46596 | 5.1% |
| Other values (10) | 39805 | 4.4% |
Punctuation
| Value | Count | Frequency (%) |
| – | 4 |
Fax
Categorical
| Distinct | 15752 |
|---|---|
| Distinct (%) | 20.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 905-822-2673 | 41 |
|---|---|
| 905-361-6401 | 37 |
| 905-896-9380 | 31 |
| 905-502-6982 | 18 |
| Other values (15747) |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 7.7664163 |
| Min length | 1 |
Characters and Unicode
| Total characters | 606029 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4752 ? |
|---|---|
| Unique (%) | 6.1% |
Sample
| 1st row | 905-795-8988 |
|---|---|
| 2nd row | 905-795-8775 |
| 3rd row | 905-795-8775 |
| 4th row | 905-564-7395 |
| 5th row | 905-564-5003 |
Common Values
| Value | Count | Frequency (%) |
| 29473 | ||
| 905-822-2673 | 41 | 0.1% |
| 905-361-6401 | 37 | < 0.1% |
| 905-896-9380 | 31 | < 0.1% |
| 905-502-6982 | 18 | < 0.1% |
| 905-625-4815 | 17 | < 0.1% |
| 905-542-0987 | 16 | < 0.1% |
| 905-607-9204 | 16 | < 0.1% |
| 905-625-8815 | 15 | < 0.1% |
| 905-403-8409 | 14 | < 0.1% |
| Other values (15742) | 48354 |
Length
| Value | Count | Frequency (%) |
| 905-822-2673 | 41 | 0.1% |
| 905-361-6401 | 37 | 0.1% |
| 905-896-9380 | 31 | 0.1% |
| 905-502-6982 | 18 | < 0.1% |
| 905-625-4815 | 17 | < 0.1% |
| 905-542-0987 | 16 | < 0.1% |
| 905-607-9204 | 16 | < 0.1% |
| 905-625-8815 | 15 | < 0.1% |
| 905-403-8409 | 14 | < 0.1% |
| 905-625-8245 | 13 | < 0.1% |
| Other values (15742) | 48342 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 90675 | |
| 0 | 79738 | |
| 5 | 78040 | |
| 9 | 75509 | |
| 6 | 47327 | |
| 2 | 44185 | |
| 8 | 39652 | |
| 7 | 37892 | |
| 1 | 30365 | 5.0% |
| 29474 | 4.9% | |
| Other values (2) | 53172 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 485880 | |
| Dash Punctuation | 90675 | 15.0% |
| Space Separator | 29474 | 4.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 79738 | |
| 5 | 78040 | |
| 9 | 75509 | |
| 6 | 47327 | |
| 2 | 44185 | |
| 8 | 39652 | |
| 7 | 37892 | |
| 1 | 30365 | 6.2% |
| 4 | 27785 | 5.7% |
| 3 | 25387 | 5.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 90675 |
Space Separator
| Value | Count | Frequency (%) |
| 29474 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 606029 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 90675 | |
| 0 | 79738 | |
| 5 | 78040 | |
| 9 | 75509 | |
| 6 | 47327 | |
| 2 | 44185 | |
| 8 | 39652 | |
| 7 | 37892 | |
| 1 | 30365 | 5.0% |
| 29474 | 4.9% | |
| Other values (2) | 53172 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 606029 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 90675 | |
| 0 | 79738 | |
| 5 | 78040 | |
| 9 | 75509 | |
| 6 | 47327 | |
| 2 | 44185 | |
| 8 | 39652 | |
| 7 | 37892 | |
| 1 | 30365 | 5.0% |
| 29474 | 4.9% | |
| Other values (2) | 53172 |
TollFree
Categorical
| Distinct | 4117 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 1-800-769-2511 | 32 |
|---|---|
| 1-800-465-2422 | 32 |
| 1-800-472-6842 | 23 |
| 1-877-777-8672 | 16 |
| Other values (4112) |
Length
| Max length | 16 |
|---|---|
| Median length | 1 |
| Mean length | 2.8538933 |
| Min length | 1 |
Characters and Unicode
| Total characters | 222695 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1434 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 1-800-668-1101 |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 66596 | ||
| 1-800-769-2511 | 32 | < 0.1% |
| 1-800-465-2422 | 32 | < 0.1% |
| 1-800-472-6842 | 23 | < 0.1% |
| 1-877-777-8672 | 16 | < 0.1% |
| 1-877-849-3637 | 16 | < 0.1% |
| 1-866-567-8888 | 13 | < 0.1% |
| 1-800-668-0414 | 10 | < 0.1% |
| 1-800-956-9543 | 10 | < 0.1% |
| 1-866-829-9433 | 10 | < 0.1% |
| Other values (4107) | 11274 | 14.4% |
Length
| Value | Count | Frequency (%) |
| 1-800-769-2511 | 32 | 0.3% |
| 1-800-465-2422 | 32 | 0.3% |
| 1-800-472-6842 | 23 | 0.2% |
| 1-877-777-8672 | 16 | 0.1% |
| 1-877-849-3637 | 16 | 0.1% |
| 1-866-567-8888 | 13 | 0.1% |
| 1-877-526-6639 | 10 | 0.1% |
| 1-800-254-0778 | 10 | 0.1% |
| 1-800-563-4327 | 10 | 0.1% |
| 1-866-829-9433 | 10 | 0.1% |
| Other values (4111) | 11269 |
Most occurring characters
| Value | Count | Frequency (%) |
| 66601 | ||
| - | 31297 | |
| 8 | 24221 | 10.9% |
| 1 | 16130 | 7.2% |
| 0 | 14466 | 6.5% |
| 6 | 14461 | 6.5% |
| 7 | 12782 | 5.7% |
| 5 | 9818 | 4.4% |
| 2 | 9799 | 4.4% |
| 3 | 8526 | 3.8% |
| Other values (5) | 14594 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 124793 | |
| Space Separator | 66601 | |
| Dash Punctuation | 31299 | 14.1% |
| Lowercase Letter | 1 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 24221 | |
| 1 | 16130 | |
| 0 | 14466 | |
| 6 | 14461 | |
| 7 | 12782 | |
| 5 | 9818 | |
| 2 | 9799 | |
| 3 | 8526 | 6.8% |
| 4 | 7930 | 6.4% |
| 9 | 6660 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 31297 | |
| – | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 66601 |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 222694 | |
| Latin | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 66601 | ||
| - | 31297 | |
| 8 | 24221 | 10.9% |
| 1 | 16130 | 7.2% |
| 0 | 14466 | 6.5% |
| 6 | 14461 | 6.5% |
| 7 | 12782 | 5.7% |
| 5 | 9818 | 4.4% |
| 2 | 9799 | 4.4% |
| 3 | 8526 | 3.8% |
| Other values (4) | 14593 | 6.6% |
Latin
| Value | Count | Frequency (%) |
| x | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 222693 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 66601 | ||
| - | 31297 | |
| 8 | 24221 | 10.9% |
| 1 | 16130 | 7.2% |
| 0 | 14466 | 6.5% |
| 6 | 14461 | 6.5% |
| 7 | 12782 | 5.7% |
| 5 | 9818 | 4.4% |
| 2 | 9799 | 4.4% |
| 3 | 8526 | 3.8% |
| Other values (4) | 14592 | 6.6% |
Punctuation
| Value | Count | Frequency (%) |
| – | 2 |
EMail
Categorical
| Distinct | 15058 |
|---|---|
| Distinct (%) | 19.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| info@publicstoragecanada.com | 21 |
|---|---|
| info@taxwide.com | 20 |
| info@ucmas.ca | 13 |
| info@mississaugaschoolofmusic.ca | 13 |
| Other values (15053) |
Length
| Max length | 97 |
|---|---|
| Median length | 55 |
| Mean length | 14.085132 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1099091 |
|---|---|
| Distinct characters | 78 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 3361 ? |
|---|---|
| Unique (%) | 4.3% |
Sample
| 1st row | lfinch@golftrendsinc.com |
|---|---|
| 2nd row | prepress@apexgraphics.com |
| 3rd row | |
| 4th row | info@printmedia.ca |
| 5th row | shsieh@swrltd.com |
Common Values
| Value | Count | Frequency (%) |
| 30506 | ||
| info@publicstoragecanada.com | 21 | < 0.1% |
| info@taxwide.com | 20 | < 0.1% |
| info@ucmas.ca | 13 | < 0.1% |
| info@mississaugaschoolofmusic.ca | 13 | < 0.1% |
| cyclone@cyclonemfg.com | 12 | < 0.1% |
| millertrailers@rogers.com | 12 | < 0.1% |
| info@realfruitbubbletea.com | 12 | < 0.1% |
| info@akaloptical.com | 12 | < 0.1% |
| ktc.ca.info@kapsch.net | 12 | < 0.1% |
| Other values (15048) | 47399 |
Length
| Value | Count | Frequency (%) |
| info@publicstoragecanada.com | 21 | < 0.1% |
| info@taxwide.com | 20 | < 0.1% |
| info@ucmas.ca | 13 | < 0.1% |
| info@mississaugaschoolofmusic.ca | 13 | < 0.1% |
| cyclone@cyclonemfg.com | 12 | < 0.1% |
| millertrailers@rogers.com | 12 | < 0.1% |
| info@realfruitbubbletea.com | 12 | < 0.1% |
| info@akaloptical.com | 12 | < 0.1% |
| ktc.ca.info@kapsch.net | 12 | < 0.1% |
| insure@all-risks.com | 11 | < 0.1% |
| Other values (15012) | 47482 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 99086 | 9.0% |
| a | 97080 | 8.8% |
| c | 83214 | 7.6% |
| i | 74076 | 6.7% |
| e | 72811 | 6.6% |
| n | 63754 | 5.8% |
| m | 63062 | 5.7% |
| s | 58432 | 5.3% |
| r | 53466 | 4.9% |
| . | 51798 | 4.7% |
| Other values (68) | 382312 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 953466 | |
| Other Punctuation | 99332 | 9.0% |
| Space Separator | 30707 | 2.8% |
| Decimal Number | 11022 | 1.0% |
| Uppercase Letter | 1925 | 0.2% |
| Dash Punctuation | 1864 | 0.2% |
| Connector Punctuation | 766 | 0.1% |
| Control | 4 | < 0.1% |
| Modifier Symbol | 3 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 99086 | |
| a | 97080 | |
| c | 83214 | 8.7% |
| i | 74076 | 7.8% |
| e | 72811 | 7.6% |
| n | 63754 | 6.7% |
| m | 63062 | 6.6% |
| s | 58432 | 6.1% |
| r | 53466 | 5.6% |
| t | 50375 | 5.3% |
| Other values (16) | 238110 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 281 | |
| S | 211 | 11.0% |
| M | 203 | 10.5% |
| C | 133 | 6.9% |
| A | 122 | 6.3% |
| D | 96 | 5.0% |
| P | 88 | 4.6% |
| B | 81 | 4.2% |
| J | 79 | 4.1% |
| T | 77 | 4.0% |
| Other values (16) | 554 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1932 | |
| 0 | 1824 | |
| 2 | 1678 | |
| 3 | 975 | |
| 5 | 873 | |
| 4 | 804 | |
| 7 | 764 | 6.9% |
| 6 | 755 | 6.8% |
| 8 | 753 | 6.8% |
| 9 | 664 | 6.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 51798 | |
| @ | 47451 | |
| / | 35 | < 0.1% |
| & | 18 | < 0.1% |
| , | 8 | < 0.1% |
| ' | 7 | < 0.1% |
| # | 5 | < 0.1% |
| : | 5 | < 0.1% |
| · | 5 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 30707 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1864 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 766 |
Control
| Value | Count | Frequency (%) |
| 4 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 3 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 955391 | |
| Common | 143700 | 13.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 99086 | |
| a | 97080 | |
| c | 83214 | 8.7% |
| i | 74076 | 7.8% |
| e | 72811 | 7.6% |
| n | 63754 | 6.7% |
| m | 63062 | 6.6% |
| s | 58432 | 6.1% |
| r | 53466 | 5.6% |
| t | 50375 | 5.3% |
| Other values (42) | 240035 |
Common
| Value | Count | Frequency (%) |
| . | 51798 | |
| @ | 47451 | |
| 30707 | ||
| 1 | 1932 | 1.3% |
| - | 1864 | 1.3% |
| 0 | 1824 | 1.3% |
| 2 | 1678 | 1.2% |
| 3 | 975 | 0.7% |
| 5 | 873 | 0.6% |
| 4 | 804 | 0.6% |
| Other values (16) | 3794 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1099085 | |
| None | 5 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 99086 | 9.0% |
| a | 97080 | 8.8% |
| c | 83214 | 7.6% |
| i | 74076 | 6.7% |
| e | 72811 | 6.6% |
| n | 63754 | 5.8% |
| m | 63062 | 5.7% |
| s | 58432 | 5.3% |
| r | 53466 | 4.9% |
| . | 51798 | 4.7% |
| Other values (66) | 382306 |
None
| Value | Count | Frequency (%) |
| · | 5 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
WebAddress
Categorical
| Distinct | 14200 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| www.dpcdsb.org | 221 |
|---|---|
| www.subway.com | 215 |
| www.timhortons.com | 211 |
| www.petro-canada.ca | 115 |
| Other values (14195) |
Length
| Max length | 84 |
|---|---|
| Median length | 50 |
| Mean length | 14.525797 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1133477 |
|---|---|
| Distinct characters | 80 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2033 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | www.golftrendsinc.com |
|---|---|
| 2nd row | www.apexgraphics.com |
| 3rd row | |
| 4th row | www.printmedia.ca |
| 5th row | www.swrltd.com |
Common Values
| Value | Count | Frequency (%) |
| 21267 | 27.3% | |
| www.dpcdsb.org | 221 | 0.3% |
| www.subway.com | 215 | 0.3% |
| www.timhortons.com | 211 | 0.3% |
| www.petro-canada.ca | 115 | 0.1% |
| www.shoppersdrugmart.ca | 107 | 0.1% |
| www.mississauga.ca/portal/residents/fire | 95 | 0.1% |
| www.td.com | 91 | 0.1% |
| www.dollarama.com | 88 | 0.1% |
| www.shell.ca | 84 | 0.1% |
| Other values (14190) | 55538 |
Length
| Value | Count | Frequency (%) |
| www.dpcdsb.org | 221 | 0.4% |
| www.subway.com | 215 | 0.4% |
| www.timhortons.com | 211 | 0.4% |
| www.petro-canada.ca | 115 | 0.2% |
| www.shoppersdrugmart.ca | 107 | 0.2% |
| www.mississauga.ca/portal/residents/fire | 95 | 0.2% |
| www.td.com | 91 | 0.2% |
| www.dollarama.com | 88 | 0.2% |
| www.shell.ca | 84 | 0.1% |
| www.starbucks.ca | 83 | 0.1% |
| Other values (14093) | 55516 |
Most occurring characters
| Value | Count | Frequency (%) |
| w | 178470 | |
| . | 114796 | 10.1% |
| c | 90000 | 7.9% |
| a | 87304 | 7.7% |
| o | 81312 | 7.2% |
| e | 65391 | 5.8% |
| m | 55954 | 4.9% |
| s | 50675 | 4.5% |
| i | 50384 | 4.4% |
| r | 49832 | 4.4% |
| Other values (70) | 309359 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 989738 | |
| Other Punctuation | 116174 | 10.2% |
| Space Separator | 21324 | 1.9% |
| Dash Punctuation | 2684 | 0.2% |
| Decimal Number | 2467 | 0.2% |
| Uppercase Letter | 1007 | 0.1% |
| Math Symbol | 52 | < 0.1% |
| Control | 10 | < 0.1% |
| Connector Punctuation | 10 | < 0.1% |
| Modifier Symbol | 8 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 178470 | |
| c | 90000 | 9.1% |
| a | 87304 | 8.8% |
| o | 81312 | 8.2% |
| e | 65391 | 6.6% |
| m | 55954 | 5.7% |
| s | 50675 | 5.1% |
| i | 50384 | 5.1% |
| r | 49832 | 5.0% |
| t | 47223 | 4.8% |
| Other values (17) | 233193 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 108 | 10.7% |
| W | 105 | 10.4% |
| S | 71 | 7.1% |
| M | 70 | 7.0% |
| T | 59 | 5.9% |
| A | 57 | 5.7% |
| L | 57 | 5.7% |
| F | 52 | 5.2% |
| R | 51 | 5.1% |
| P | 41 | 4.1% |
| Other values (16) | 336 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 551 | |
| 2 | 475 | |
| 0 | 349 | |
| 4 | 324 | |
| 3 | 230 | |
| 6 | 129 | 5.2% |
| 8 | 119 | 4.8% |
| 9 | 119 | 4.8% |
| 5 | 101 | 4.1% |
| 7 | 70 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 114796 | |
| / | 1297 | 1.1% |
| @ | 47 | < 0.1% |
| & | 18 | < 0.1% |
| \ | 6 | < 0.1% |
| , | 4 | < 0.1% |
| : | 3 | < 0.1% |
| ' | 2 | < 0.1% |
| · | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 21324 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2684 |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 52 |
Control
| Value | Count | Frequency (%) |
| 10 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 10 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 8 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 990745 | |
| Common | 142732 | 12.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| w | 178470 | |
| c | 90000 | 9.1% |
| a | 87304 | 8.8% |
| o | 81312 | 8.2% |
| e | 65391 | 6.6% |
| m | 55954 | 5.6% |
| s | 50675 | 5.1% |
| i | 50384 | 5.1% |
| r | 49832 | 5.0% |
| t | 47223 | 4.8% |
| Other values (43) | 234200 |
Common
| Value | Count | Frequency (%) |
| . | 114796 | |
| 21324 | 14.9% | |
| - | 2684 | 1.9% |
| / | 1297 | 0.9% |
| 1 | 551 | 0.4% |
| 2 | 475 | 0.3% |
| 0 | 349 | 0.2% |
| 4 | 324 | 0.2% |
| 3 | 230 | 0.2% |
| 6 | 129 | 0.1% |
| Other values (17) | 573 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1133473 | |
| None | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| w | 178470 | |
| . | 114796 | 10.1% |
| c | 90000 | 7.9% |
| a | 87304 | 7.7% |
| o | 81312 | 7.2% |
| e | 65391 | 5.8% |
| m | 55954 | 4.9% |
| s | 50675 | 4.5% |
| i | 50384 | 4.4% |
| r | 49832 | 4.4% |
| Other values (68) | 309355 |
None
| Value | Count | Frequency (%) |
| é | 3 | |
| · | 1 | 25.0% |
EmplRange
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 609.8 KiB |
| 1 to 4 | |
|---|---|
| 5 to 9 | |
| 10 to 19 | |
| 20 to 49 | |
| 50 to 99 | 3313 |
| Other values (4) | 2727 |
Length
| Max length | 10 |
|---|---|
| Median length | 6 |
| Mean length | 6.6960567 |
| Min length | 5 |
Characters and Unicode
| Total characters | 522500 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 10 to 19 |
|---|---|
| 2nd row | 20 to 49 |
| 3rd row | 50 to 99 |
| 4th row | 1 to 4 |
| 5th row | 5 to 9 |
Common Values
| Value | Count | Frequency (%) |
| 1 to 4 | 37311 | |
| 5 to 9 | 16050 | |
| 10 to 19 | 10510 | 13.5% |
| 20 to 49 | 8120 | 10.4% |
| 50 to 99 | 3313 | 4.2% |
| 100 to 299 | 2149 | 2.8% |
| 300 to 499 | 318 | 0.4% |
| 500 to 999 | 164 | 0.2% |
| 1000+ | 96 | 0.1% |
| (Missing) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| to | 77935 | |
| 1 | 37311 | |
| 4 | 37311 | |
| 5 | 16050 | 6.9% |
| 9 | 16050 | 6.9% |
| 10 | 10510 | 4.5% |
| 19 | 10510 | 4.5% |
| 20 | 8120 | 3.5% |
| 49 | 8120 | 3.5% |
| 99 | 3313 | 1.4% |
| Other values (8) | 8671 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 155870 | ||
| t | 77935 | |
| o | 77935 | |
| 1 | 60576 | 11.6% |
| 9 | 46732 | 8.9% |
| 4 | 45749 | 8.8% |
| 0 | 27493 | 5.3% |
| 5 | 19527 | 3.7% |
| 2 | 10269 | 2.0% |
| 3 | 318 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 210664 | |
| Space Separator | 155870 | |
| Lowercase Letter | 155870 | |
| Math Symbol | 96 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 60576 | |
| 9 | 46732 | |
| 4 | 45749 | |
| 0 | 27493 | |
| 5 | 19527 | 9.3% |
| 2 | 10269 | 4.9% |
| 3 | 318 | 0.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 77935 | |
| o | 77935 |
Space Separator
| Value | Count | Frequency (%) |
| 155870 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 96 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 366630 | |
| Latin | 155870 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 155870 | ||
| 1 | 60576 | 16.5% |
| 9 | 46732 | 12.7% |
| 4 | 45749 | 12.5% |
| 0 | 27493 | 7.5% |
| 5 | 19527 | 5.3% |
| 2 | 10269 | 2.8% |
| 3 | 318 | 0.1% |
| + | 96 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| t | 77935 | |
| o | 77935 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 522500 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 155870 | ||
| t | 77935 | |
| o | 77935 | |
| 1 | 60576 | 11.6% |
| 9 | 46732 | 8.9% |
| 4 | 45749 | 8.8% |
| 0 | 27493 | 5.3% |
| 5 | 19527 | 3.7% |
| 2 | 10269 | 2.0% |
| 3 | 318 | 0.1% |
| Distinct | 433 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 15002 |
| Missing (%) | 19.2% |
| Memory size | 609.8 KiB |
| 2017/11/08 00:00:00+00 | |
|---|---|
| 2018/12/30 00:00:00+00 | |
| 2017/11/09 00:00:00+00 | |
| 2015/10/31 00:00:00+00 | |
| 2016/10/31 00:00:00+00 | |
| Other values (428) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 1386660 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 111 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2015/10/31 00:00:00+00 |
|---|---|
| 2nd row | 2016/10/31 00:00:00+00 |
| 3rd row | 2015/10/31 00:00:00+00 |
| 4th row | 2015/10/31 00:00:00+00 |
| 5th row | 2015/10/31 00:00:00+00 |
Common Values
| Value | Count | Frequency (%) |
| 2017/11/08 00:00:00+00 | 11037 | |
| 2018/12/30 00:00:00+00 | 9918 | |
| 2017/11/09 00:00:00+00 | 8042 | |
| 2015/10/31 00:00:00+00 | 4560 | 5.8% |
| 2016/10/31 00:00:00+00 | 4499 | 5.8% |
| 2019/12/12 00:00:00+00 | 3326 | 4.3% |
| 2019/09/19 00:00:00+00 | 2718 | 3.5% |
| 2018/09/30 00:00:00+00 | 849 | 1.1% |
| 2017/06/08 00:00:00+00 | 726 | 0.9% |
| 2017/05/24 00:00:00+00 | 646 | 0.8% |
| Other values (423) | 16709 | |
| (Missing) | 15002 |
Length
| Value | Count | Frequency (%) |
| 00:00:00+00 | 63030 | |
| 2017/11/08 | 11037 | 8.8% |
| 2018/12/30 | 9918 | 7.9% |
| 2017/11/09 | 8042 | 6.4% |
| 2015/10/31 | 4560 | 3.6% |
| 2016/10/31 | 4499 | 3.6% |
| 2019/12/12 | 3326 | 2.6% |
| 2019/09/19 | 2718 | 2.2% |
| 2018/09/30 | 849 | 0.7% |
| 2017/06/08 | 726 | 0.6% |
| Other values (424) | 17355 | 13.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 635253 | |
| 1 | 147277 | 10.6% |
| / | 126060 | 9.1% |
| : | 126060 | 9.1% |
| 2 | 85561 | 6.2% |
| 63030 | 4.5% | |
| + | 63030 | 4.5% |
| 7 | 33726 | 2.4% |
| 8 | 28228 | 2.0% |
| 9 | 23888 | 1.7% |
| Other values (4) | 54547 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1008480 | |
| Other Punctuation | 252120 | 18.2% |
| Space Separator | 63030 | 4.5% |
| Math Symbol | 63030 | 4.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 635253 | |
| 1 | 147277 | 14.6% |
| 2 | 85561 | 8.5% |
| 7 | 33726 | 3.3% |
| 8 | 28228 | 2.8% |
| 9 | 23888 | 2.4% |
| 3 | 22832 | 2.3% |
| 5 | 16305 | 1.6% |
| 6 | 13078 | 1.3% |
| 4 | 2332 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 126060 | |
| : | 126060 |
Space Separator
| Value | Count | Frequency (%) |
| 63030 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 63030 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1386660 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 635253 | |
| 1 | 147277 | 10.6% |
| / | 126060 | 9.1% |
| : | 126060 | 9.1% |
| 2 | 85561 | 6.2% |
| 63030 | 4.5% | |
| + | 63030 | 4.5% |
| 7 | 33726 | 2.4% |
| 8 | 28228 | 2.0% |
| 9 | 23888 | 1.7% |
| Other values (4) | 54547 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1386660 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 635253 | |
| 1 | 147277 | 10.6% |
| / | 126060 | 9.1% |
| : | 126060 | 9.1% |
| 2 | 85561 | 6.2% |
| 63030 | 4.5% | |
| + | 63030 | 4.5% |
| 7 | 33726 | 2.4% |
| 8 | 28228 | 2.0% |
| 9 | 23888 | 1.7% |
| Other values (4) | 54547 | 3.9% |
| Distinct | 29 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 63430 |
| Missing (%) | 81.3% |
| Memory size | 609.8 KiB |
| Financial Services | 870 |
|---|---|
| Food and Beverage | 444 |
| Automotive | 329 |
| Life Sciences | 263 |
| Other values (24) | 313 |
Length
| Max length | 57 |
|---|---|
| Median length | 1 |
| Mean length | 3.3009177 |
| Min length | 1 |
Characters and Unicode
| Total characters | 48200 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 12383 | 15.9% | |
| Financial Services | 870 | 1.1% |
| Food and Beverage | 444 | 0.6% |
| Automotive | 329 | 0.4% |
| Life Sciences | 263 | 0.3% |
| Aerospace | 132 | 0.2% |
| Automotive,Aerospace | 55 | 0.1% |
| Cleantech | 24 | < 0.1% |
| Automotive,Food and Beverage | 24 | < 0.1% |
| Automotive,Aerospace,Food and Beverage | 15 | < 0.1% |
| Other values (19) | 63 | 0.1% |
| (Missing) | 63430 |
Length
| Value | Count | Frequency (%) |
| services | 884 | |
| financial | 870 | |
| and | 528 | |
| beverage | 514 | |
| food | 452 | |
| automotive | 329 | 7.4% |
| life | 281 | 6.3% |
| sciences | 265 | 5.9% |
| aerospace | 132 | 3.0% |
| automotive,aerospace | 55 | 1.2% |
| Other values (15) | 145 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 14623 | ||
| e | 5221 | 10.8% |
| i | 3691 | 7.7% |
| a | 3091 | 6.4% |
| c | 2627 | 5.5% |
| n | 2626 | 5.4% |
| o | 2183 | 4.5% |
| v | 1859 | 3.9% |
| r | 1645 | 3.4% |
| s | 1413 | 2.9% |
| Other values (16) | 9221 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29243 | |
| Space Separator | 14623 | |
| Uppercase Letter | 4130 | 8.6% |
| Other Punctuation | 204 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5221 | |
| i | 3691 | |
| a | 3091 | |
| c | 2627 | |
| n | 2626 | |
| o | 2183 | |
| v | 1859 | 6.4% |
| r | 1645 | 5.6% |
| s | 1413 | 4.8% |
| d | 1056 | 3.6% |
| Other values (8) | 3831 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1412 | |
| S | 1180 | |
| A | 680 | |
| B | 528 | 12.8% |
| L | 296 | 7.2% |
| C | 34 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 14623 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 204 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33373 | |
| Common | 14827 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5221 | |
| i | 3691 | |
| a | 3091 | |
| c | 2627 | 7.9% |
| n | 2626 | 7.9% |
| o | 2183 | 6.5% |
| v | 1859 | 5.6% |
| r | 1645 | 4.9% |
| s | 1413 | 4.2% |
| F | 1412 | 4.2% |
| Other values (14) | 7605 |
Common
| Value | Count | Frequency (%) |
| 14623 | ||
| , | 204 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 14623 | ||
| e | 5221 | 10.8% |
| i | 3691 | 7.7% |
| a | 3091 | 6.4% |
| c | 2627 | 5.5% |
| n | 2626 | 5.4% |
| o | 2183 | 4.5% |
| v | 1859 | 3.9% |
| r | 1645 | 3.4% |
| s | 1413 | 2.9% |
| Other values (16) | 9221 |
| Distinct | 4685 |
|---|---|
| Distinct (%) | 15.4% |
| Missing | 47693 |
| Missing (%) | 61.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 608659.35 |
| Minimum | 596627.93 |
|---|---|
| Maximum | 616985.06 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 596627.93 |
|---|---|
| 5-th percentile | 601465.65 |
| Q1 | 606483.02 |
| median | 608923.98 |
| Q3 | 611391.08 |
| 95-th percentile | 614814.86 |
| Maximum | 616985.06 |
| Range | 20357.121 |
| Interquartile range (IQR) | 4908.0572 |
Descriptive statistics
| Standard deviation | 3852.0245 |
|---|---|
| Coefficient of variation (CV) | 0.0063287033 |
| Kurtosis | -0.066028416 |
| Mean | 608659.35 |
| Median Absolute Deviation (MAD) | 2462.861 |
| Skewness | -0.41317914 |
| Sum | 1.8466116 × 1010 |
| Variance | 14838093 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 609556.5032 | 367 | 0.5% |
| 612552.1674 | 255 | 0.3% |
| 604009.418 | 228 | 0.3% |
| 609657.7584 | 205 | 0.3% |
| 615480.8966 | 178 | 0.2% |
| 604848.575 | 110 | 0.1% |
| 608539.0792 | 107 | 0.1% |
| 612581.1624 | 106 | 0.1% |
| 608826.735 | 100 | 0.1% |
| 600161.54 | 100 | 0.1% |
| Other values (4675) | 28583 | |
| (Missing) | 47693 |
| Value | Count | Frequency (%) |
| 596627.9342 | 2 | < 0.1% |
| 596752.9696 | 2 | < 0.1% |
| 597309.0542 | 3 | < 0.1% |
| 597312.632 | 2 | < 0.1% |
| 597772.3526 | 49 | |
| 597782.4012 | 2 | < 0.1% |
| 597812.404 | 2 | < 0.1% |
| 597933.2448 | 13 | < 0.1% |
| 597963.9396 | 25 | |
| 598104.1884 | 24 |
| Value | Count | Frequency (%) |
| 616985.0552 | 9 | |
| 616917.8604 | 1 | < 0.1% |
| 616879.86 | 1 | < 0.1% |
| 616836.9092 | 2 | < 0.1% |
| 616794.193 | 2 | < 0.1% |
| 616756.05 | 2 | < 0.1% |
| 616706.7026 | 2 | < 0.1% |
| 616695.363 | 4 | |
| 616668.1574 | 2 | < 0.1% |
| 616652.9546 | 1 | < 0.1% |
| Distinct | 7965 |
|---|---|
| Distinct (%) | 26.3% |
| Missing | 47693 |
| Missing (%) | 61.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4829613.5 |
| Minimum | 4815546.6 |
|---|---|
| Maximum | 4843107.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 4815546.6 |
|---|---|
| 5-th percentile | 4819703.7 |
| Q1 | 4825956.9 |
| median | 4829277.7 |
| Q3 | 4833786.4 |
| 95-th percentile | 4839313.8 |
| Maximum | 4843107.8 |
| Range | 27561.199 |
| Interquartile range (IQR) | 7829.5471 |
Descriptive statistics
| Standard deviation | 5660.9074 |
|---|---|
| Coefficient of variation (CV) | 0.0011721243 |
| Kurtosis | -0.58959863 |
| Mean | 4829613.5 |
| Median Absolute Deviation (MAD) | 3923.2538 |
| Skewness | -0.0065033252 |
| Sum | 1.4652564 × 1011 |
| Variance | 32045872 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4837278.362 | 255 | 0.3% |
| 4827620.949 | 185 | 0.2% |
| 4827620.949 | 182 | 0.2% |
| 4823628.592 | 115 | 0.1% |
| 4823628.592 | 113 | 0.1% |
| 4841687.188 | 107 | 0.1% |
| 4841687.188 | 98 | 0.1% |
| 4827728.859 | 91 | 0.1% |
| 4827728.859 | 87 | 0.1% |
| 4822083.931 | 86 | 0.1% |
| Other values (7955) | 29020 | |
| (Missing) | 47693 |
| Value | Count | Frequency (%) |
| 4815546.641 | 1 | |
| 4815609.051 | 1 | |
| 4815609.051 | 1 | |
| 4816109.607 | 2 | |
| 4816333.508 | 2 | |
| 4816381.801 | 2 | |
| 4816381.801 | 2 | |
| 4816389.354 | 1 | |
| 4816389.354 | 1 | |
| 4816462.515 | 1 |
| Value | Count | Frequency (%) |
| 4843107.84 | 9 | |
| 4843107.84 | 10 | |
| 4843040.829 | 1 | < 0.1% |
| 4843040.829 | 1 | < 0.1% |
| 4842998.68 | 1 | < 0.1% |
| 4842998.68 | 1 | < 0.1% |
| 4842855.077 | 1 | < 0.1% |
| 4842855.077 | 1 | < 0.1% |
| 4842717.945 | 1 | < 0.1% |
| 4842717.945 | 1 | < 0.1% |
Year
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 609.8 KiB |
| 2019 | |
|---|---|
| 2018 | |
| 2017 | |
| 2021 | |
| 2016 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 312128 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2016 |
|---|---|
| 2nd row | 2016 |
| 3rd row | 2016 |
| 4th row | 2016 |
| 5th row | 2016 |
Common Values
| Value | Count | Frequency (%) |
| 2019 | 16518 | |
| 2018 | 16350 | |
| 2017 | 15737 | |
| 2021 | 14825 | |
| 2016 | 14602 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2019 | 16518 | |
| 2018 | 16350 | |
| 2017 | 15737 | |
| 2021 | 14825 | |
| 2016 | 14602 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 92857 | |
| 0 | 78032 | |
| 1 | 78032 | |
| 9 | 16518 | 5.3% |
| 8 | 16350 | 5.2% |
| 7 | 15737 | 5.0% |
| 6 | 14602 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 312128 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 92857 | |
| 0 | 78032 | |
| 1 | 78032 | |
| 9 | 16518 | 5.3% |
| 8 | 16350 | 5.2% |
| 7 | 15737 | 5.0% |
| 6 | 14602 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 312128 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 92857 | |
| 0 | 78032 | |
| 1 | 78032 | |
| 9 | 16518 | 5.3% |
| 8 | 16350 | 5.2% |
| 7 | 15737 | 5.0% |
| 6 | 14602 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 312128 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 92857 | |
| 0 | 78032 | |
| 1 | 78032 | |
| 9 | 16518 | 5.3% |
| 8 | 16350 | 5.2% |
| 7 | 15737 | 5.0% |
| 6 | 14602 | 4.7% |
| Distinct | 4961 |
|---|---|
| Distinct (%) | 10.4% |
| Missing | 30339 |
| Missing (%) | 38.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11122872 |
| Minimum | 32500 |
|---|---|
| Maximum | 32656400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 32500 |
|---|---|
| 5-th percentile | 1878100 |
| Q1 | 5158600 |
| median | 10172700 |
| Q3 | 14774800 |
| 95-th percentile | 28577700 |
| Maximum | 32656400 |
| Range | 32623900 |
| Interquartile range (IQR) | 9616200 |
Descriptive statistics
| Standard deviation | 7579367.8 |
|---|---|
| Coefficient of variation (CV) | 0.68142186 |
| Kurtosis | 0.64239942 |
| Mean | 11122872 |
| Median Absolute Deviation (MAD) | 4630200 |
| Skewness | 1.0445894 |
| Sum | 5.3048311 × 1011 |
| Variance | 5.7446816 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6068300 | 586 | 0.8% |
| 31141506 | 414 | 0.5% |
| 4407700 | 328 | 0.4% |
| 9663800 | 287 | 0.4% |
| 12876900 | 216 | 0.3% |
| 24265600 | 190 | 0.2% |
| 14804200 | 186 | 0.2% |
| 31381800 | 177 | 0.2% |
| 17704200 | 161 | 0.2% |
| 10173700 | 147 | 0.2% |
| Other values (4951) | 45001 | |
| (Missing) | 30339 |
| Value | Count | Frequency (%) |
| 32500 | 3 | < 0.1% |
| 37200 | 10 | < 0.1% |
| 37300 | 2 | < 0.1% |
| 37400 | 33 | |
| 38100 | 2 | < 0.1% |
| 38300 | 9 | < 0.1% |
| 38400 | 14 | |
| 38500 | 2 | < 0.1% |
| 38600 | 13 | < 0.1% |
| 38700 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 32656400 | 1 | < 0.1% |
| 32646400 | 44 | |
| 32551400 | 1 | < 0.1% |
| 32526400 | 2 | < 0.1% |
| 32476400 | 11 | < 0.1% |
| 32442000 | 5 | < 0.1% |
| 32441600 | 2 | < 0.1% |
| 32436400 | 25 | |
| 32431500 | 43 | |
| 32371800 | 1 | < 0.1% |
| Distinct | 56 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 61682 |
| Missing (%) | 79.0% |
| Memory size | 609.8 KiB |
| Northeast EA (West) | |
|---|---|
| Dixie EA | |
| Gateway EA (East) | |
| Meadowvale Business Park CC | |
| Western Business Park EA | |
| Other values (51) |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 16.546361 |
| Min length | 7 |
Characters and Unicode
| Total characters | 270533 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Cooksville NHD (East) |
|---|---|
| 2nd row | Rathwood NHD |
| 3rd row | Cooksville NHD (East) |
| 4th row | Rathwood-Applewood CN |
| 5th row | Cooksville NHD (East) |
Common Values
| Value | Count | Frequency (%) |
| Northeast EA (West) | 4700 | 6.0% |
| Dixie EA | 1048 | 1.3% |
| Gateway EA (East) | 1034 | 1.3% |
| Meadowvale Business Park CC | 998 | 1.3% |
| Western Business Park EA | 847 | 1.1% |
| DT Core | 738 | 0.9% |
| Airport CC | 507 | 0.6% |
| Northeast EA (East) | 411 | 0.5% |
| DT Cooksville | 409 | 0.5% |
| Mavis-Erindale EA | 392 | 0.5% |
| Other values (46) | 5266 | 6.7% |
| (Missing) | 61682 |
Length
| Value | Count | Frequency (%) |
| ea | 8946 | |
| northeast | 5111 | 11.3% |
| west | 5028 | 11.1% |
| nhd | 2823 | 6.2% |
| park | 2036 | 4.5% |
| east | 1943 | 4.3% |
| business | 1845 | 4.1% |
| cc | 1768 | 3.9% |
| gateway | 1473 | 3.2% |
| dt | 1329 | 2.9% |
| Other values (45) | 13071 |
Most occurring characters
| Value | Count | Frequency (%) |
| 29023 | 10.7% | |
| e | 24540 | 9.1% |
| t | 23397 | 8.6% |
| s | 21055 | 7.8% |
| a | 17998 | 6.7% |
| r | 14002 | 5.2% |
| o | 12439 | 4.6% |
| E | 11848 | 4.4% |
| A | 10106 | 3.7% |
| i | 9697 | 3.6% |
| Other values (33) | 96428 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 162448 | |
| Uppercase Letter | 65047 | |
| Space Separator | 29023 | 10.7% |
| Open Punctuation | 6677 | 2.5% |
| Close Punctuation | 6677 | 2.5% |
| Dash Punctuation | 661 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 24540 | |
| t | 23397 | |
| s | 21055 | |
| a | 17998 | |
| r | 14002 | |
| o | 12439 | |
| i | 9697 | 6.0% |
| l | 6578 | 4.0% |
| h | 5996 | 3.7% |
| n | 5527 | 3.4% |
| Other values (11) | 21219 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 11848 | |
| A | 10106 | |
| N | 9262 | |
| C | 7359 | |
| W | 5875 | |
| D | 5200 | |
| H | 3127 | 4.8% |
| M | 2865 | 4.4% |
| P | 2537 | 3.9% |
| B | 1845 | 2.8% |
| Other values (8) | 5023 |
Space Separator
| Value | Count | Frequency (%) |
| 29023 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6677 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6677 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 661 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 227495 | |
| Common | 43038 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 24540 | 10.8% |
| t | 23397 | 10.3% |
| s | 21055 | 9.3% |
| a | 17998 | 7.9% |
| r | 14002 | 6.2% |
| o | 12439 | 5.5% |
| E | 11848 | 5.2% |
| A | 10106 | 4.4% |
| i | 9697 | 4.3% |
| N | 9262 | 4.1% |
| Other values (29) | 73151 |
Common
| Value | Count | Frequency (%) |
| 29023 | ||
| ( | 6677 | 15.5% |
| ) | 6677 | 15.5% |
| - | 661 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 270533 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 29023 | 10.7% | |
| e | 24540 | 9.1% |
| t | 23397 | 8.6% |
| s | 21055 | 7.8% |
| a | 17998 | 6.7% |
| r | 14002 | 5.2% |
| o | 12439 | 4.6% |
| E | 11848 | 4.4% |
| A | 10106 | 3.7% |
| i | 9697 | 3.6% |
| Other values (33) | 96428 |
| Distinct | 57 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 46689 |
| Missing (%) | 59.8% |
| Memory size | 609.8 KiB |
| Northeast EA (West) | |
|---|---|
| Gateway EA (East) | |
| Dixie EA | |
| Meadowvale Business Park CC | |
| Western Business Park EA | |
| Other values (52) |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 16.534633 |
| Min length | 7 |
Characters and Unicode
| Total characters | 518245 |
|---|---|
| Distinct characters | 44 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Northeast EA (West) |
|---|---|
| 2nd row | DT Core |
| 3rd row | Northeast EA (West) |
| 4th row | DT Core |
| 5th row | DT Core |
Common Values
| Value | Count | Frequency (%) |
| Northeast EA (West) | 8989 | 11.5% |
| Gateway EA (East) | 1975 | 2.5% |
| Dixie EA | 1955 | 2.5% |
| Meadowvale Business Park CC | 1898 | 2.4% |
| Western Business Park EA | 1636 | 2.1% |
| DT Core | 1477 | 1.9% |
| Airport CC | 996 | 1.3% |
| Northeast EA (East) | 804 | 1.0% |
| Mavis-Erindale EA | 784 | 1.0% |
| DT Cooksville | 724 | 0.9% |
| Other values (47) | 10105 | 12.9% |
| (Missing) | 46689 |
Length
| Value | Count | Frequency (%) |
| ea | 17070 | |
| northeast | 9793 | 11.3% |
| west | 9630 | 11.1% |
| nhd | 5337 | 6.1% |
| park | 3923 | 4.5% |
| east | 3694 | 4.2% |
| business | 3534 | 4.1% |
| cc | 3445 | 4.0% |
| gateway | 2875 | 3.3% |
| dt | 2519 | 2.9% |
| Other values (48) | 25104 |
Most occurring characters
| Value | Count | Frequency (%) |
| 55581 | 10.7% | |
| e | 47046 | 9.1% |
| t | 44934 | 8.7% |
| s | 40159 | 7.7% |
| a | 34746 | 6.7% |
| r | 27014 | 5.2% |
| o | 23860 | 4.6% |
| E | 22566 | 4.4% |
| A | 19328 | 3.7% |
| i | 18277 | 3.5% |
| Other values (34) | 184734 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 311221 | |
| Uppercase Letter | 124590 | |
| Space Separator | 55581 | 10.7% |
| Close Punctuation | 12769 | 2.5% |
| Open Punctuation | 12769 | 2.5% |
| Dash Punctuation | 1315 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 47046 | |
| t | 44934 | |
| s | 40159 | |
| a | 34746 | |
| r | 27014 | |
| o | 23860 | |
| i | 18277 | 5.9% |
| l | 12367 | 4.0% |
| h | 11504 | 3.7% |
| n | 10684 | 3.4% |
| Other values (12) | 40630 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 22566 | |
| A | 19328 | |
| N | 17803 | |
| C | 14198 | |
| W | 11322 | |
| D | 9811 | |
| H | 5878 | 4.7% |
| M | 5574 | 4.5% |
| P | 4873 | 3.9% |
| B | 3534 | 2.8% |
| Other values (8) | 9703 |
Space Separator
| Value | Count | Frequency (%) |
| 55581 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12769 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12769 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1315 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 435811 | |
| Common | 82434 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 47046 | 10.8% |
| t | 44934 | 10.3% |
| s | 40159 | 9.2% |
| a | 34746 | 8.0% |
| r | 27014 | 6.2% |
| o | 23860 | 5.5% |
| E | 22566 | 5.2% |
| A | 19328 | 4.4% |
| i | 18277 | 4.2% |
| N | 17803 | 4.1% |
| Other values (30) | 140078 |
Common
| Value | Count | Frequency (%) |
| 55581 | ||
| ) | 12769 | 15.5% |
| ( | 12769 | 15.5% |
| - | 1315 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 518245 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 55581 | 10.7% | |
| e | 47046 | 9.1% |
| t | 44934 | 8.7% |
| s | 40159 | 7.7% |
| a | 34746 | 6.7% |
| r | 27014 | 5.2% |
| o | 23860 | 4.6% |
| E | 22566 | 4.4% |
| A | 19328 | 3.7% |
| i | 18277 | 3.5% |
| Other values (34) | 184734 |
| Distinct | 189 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 63217 |
| Missing (%) | 81.0% |
| Memory size | 609.8 KiB |
| 2018/12/30 00:00:00+00 | |
|---|---|
| 2019/12/12 00:00:00+00 | |
| 2019/09/19 00:00:00+00 | |
| 2017/11/09 00:00:00+00 | |
| 2017/11/08 00:00:00+00 | |
| Other values (184) |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Characters and Unicode
| Total characters | 325930 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 50 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 2021/06/25 00:00:00+00 |
|---|---|
| 2nd row | 2021/06/03 00:00:00+00 |
| 3rd row | 2021/07/15 00:00:00+00 |
| 4th row | 2021/07/15 00:00:00+00 |
| 5th row | 2021/07/15 00:00:00+00 |
Common Values
| Value | Count | Frequency (%) |
| 2018/12/30 00:00:00+00 | 2771 | 3.6% |
| 2019/12/12 00:00:00+00 | 1848 | 2.4% |
| 2019/09/19 00:00:00+00 | 1586 | 2.0% |
| 2017/11/09 00:00:00+00 | 1111 | 1.4% |
| 2017/11/08 00:00:00+00 | 968 | 1.2% |
| 2021/07/02 00:00:00+00 | 354 | 0.5% |
| 2019/06/07 00:00:00+00 | 267 | 0.3% |
| 2021/05/21 00:00:00+00 | 186 | 0.2% |
| 2018/09/30 00:00:00+00 | 177 | 0.2% |
| 2021/05/17 00:00:00+00 | 168 | 0.2% |
| Other values (179) | 5379 | 6.9% |
| (Missing) | 63217 |
Length
| Value | Count | Frequency (%) |
| 00:00:00+00 | 14815 | |
| 2018/12/30 | 2771 | 9.4% |
| 2019/12/12 | 1848 | 6.2% |
| 2019/09/19 | 1586 | 5.4% |
| 2017/11/09 | 1111 | 3.7% |
| 2017/11/08 | 968 | 3.3% |
| 2021/07/02 | 354 | 1.2% |
| 2019/06/07 | 267 | 0.9% |
| 2021/05/21 | 186 | 0.6% |
| 2018/09/30 | 177 | 0.6% |
| Other values (180) | 5547 | 18.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 148805 | |
| 1 | 29895 | 9.2% |
| / | 29630 | 9.1% |
| : | 29630 | 9.1% |
| 2 | 29181 | 9.0% |
| 14815 | 4.5% | |
| + | 14815 | 4.5% |
| 9 | 8963 | 2.7% |
| 7 | 6006 | 1.8% |
| 8 | 5090 | 1.6% |
| Other values (4) | 9100 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 237040 | |
| Other Punctuation | 59260 | 18.2% |
| Space Separator | 14815 | 4.5% |
| Math Symbol | 14815 | 4.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 148805 | |
| 1 | 29895 | 12.6% |
| 2 | 29181 | 12.3% |
| 9 | 8963 | 3.8% |
| 7 | 6006 | 2.5% |
| 8 | 5090 | 2.1% |
| 3 | 3797 | 1.6% |
| 6 | 2508 | 1.1% |
| 5 | 2286 | 1.0% |
| 4 | 509 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 29630 | |
| : | 29630 |
Space Separator
| Value | Count | Frequency (%) |
| 14815 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 14815 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 325930 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 148805 | |
| 1 | 29895 | 9.2% |
| / | 29630 | 9.1% |
| : | 29630 | 9.1% |
| 2 | 29181 | 9.0% |
| 14815 | 4.5% | |
| + | 14815 | 4.5% |
| 9 | 8963 | 2.7% |
| 7 | 6006 | 1.8% |
| 8 | 5090 | 1.6% |
| Other values (4) | 9100 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 325930 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 148805 | |
| 1 | 29895 | 9.2% |
| / | 29630 | 9.1% |
| : | 29630 | 9.1% |
| 2 | 29181 | 9.0% |
| 14815 | 4.5% | |
| + | 14815 | 4.5% |
| 9 | 8963 | 2.7% |
| 7 | 6006 | 1.8% |
| 8 | 5090 | 1.6% |
| Other values (4) | 9100 | 2.8% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 63207 |
| Missing (%) | 81.0% |
| Memory size | 609.8 KiB |
| CK | 443 |
|---|---|
| MLT | 362 |
| PC | 304 |
| STR | 215 |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.1399663 |
| Min length | 1 |
Characters and Unicode
| Total characters | 16900 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 13414 | 17.2% | |
| CK | 443 | 0.6% |
| MLT | 362 | 0.5% |
| PC | 304 | 0.4% |
| STR | 215 | 0.3% |
| CLV | 87 | 0.1% |
| (Missing) | 63207 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ck | 443 | |
| mlt | 362 | |
| pc | 304 | |
| str | 215 | |
| clv | 87 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 13414 | ||
| C | 834 | 4.9% |
| T | 577 | 3.4% |
| L | 449 | 2.7% |
| K | 443 | 2.6% |
| M | 362 | 2.1% |
| P | 304 | 1.8% |
| S | 215 | 1.3% |
| R | 215 | 1.3% |
| V | 87 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 13414 | |
| Uppercase Letter | 3486 | 20.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 834 | |
| T | 577 | |
| L | 449 | |
| K | 443 | |
| M | 362 | |
| P | 304 | 8.7% |
| S | 215 | 6.2% |
| R | 215 | 6.2% |
| V | 87 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 13414 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13414 | |
| Latin | 3486 | 20.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 834 | |
| T | 577 | |
| L | 449 | |
| K | 443 | |
| M | 362 | |
| P | 304 | 8.7% |
| S | 215 | 6.2% |
| R | 215 | 6.2% |
| V | 87 | 2.5% |
Common
| Value | Count | Frequency (%) |
| 13414 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16900 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 13414 | ||
| C | 834 | 4.9% |
| T | 577 | 3.4% |
| L | 449 | 2.7% |
| K | 443 | 2.6% |
| M | 362 | 2.1% |
| P | 304 | 1.8% |
| S | 215 | 1.3% |
| R | 215 | 1.3% |
| V | 87 | 0.5% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 63207 |
| Missing (%) | 81.0% |
| Memory size | 609.8 KiB |
| Cooksville BIA | 443 |
|---|---|
| Malton BIA | 362 |
| Port Credit BIA | 304 |
| Streetsville BIA | 215 |
Length
| Max length | 16 |
|---|---|
| Median length | 1 |
| Mean length | 2.177403 |
| Min length | 1 |
Characters and Unicode
| Total characters | 32280 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | |
| 3rd row | |
| 4th row | |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 13414 | 17.2% | |
| Cooksville BIA | 443 | 0.6% |
| Malton BIA | 362 | 0.5% |
| Port Credit BIA | 304 | 0.4% |
| Streetsville BIA | 215 | 0.3% |
| Clarkson BIA | 87 | 0.1% |
| (Missing) | 63207 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bia | 1411 | |
| cooksville | 443 | 14.2% |
| malton | 362 | 11.6% |
| port | 304 | 9.7% |
| credit | 304 | 9.7% |
| streetsville | 215 | 6.9% |
| clarkson | 87 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 15129 | ||
| l | 1765 | 5.5% |
| o | 1639 | 5.1% |
| A | 1411 | 4.4% |
| B | 1411 | 4.4% |
| I | 1411 | 4.4% |
| t | 1400 | 4.3% |
| e | 1392 | 4.3% |
| i | 962 | 3.0% |
| r | 910 | 2.8% |
| Other values (10) | 4850 | 15.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 15129 | |
| Lowercase Letter | 11203 | |
| Uppercase Letter | 5948 | 18.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1765 | |
| o | 1639 | |
| t | 1400 | |
| e | 1392 | |
| i | 962 | |
| r | 910 | |
| s | 745 | |
| v | 658 | 5.9% |
| k | 530 | 4.7% |
| a | 449 | 4.0% |
| Other values (2) | 753 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1411 | |
| B | 1411 | |
| I | 1411 | |
| C | 834 | |
| M | 362 | 6.1% |
| P | 304 | 5.1% |
| S | 215 | 3.6% |
Space Separator
| Value | Count | Frequency (%) |
| 15129 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17151 | |
| Common | 15129 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1765 | |
| o | 1639 | |
| A | 1411 | 8.2% |
| B | 1411 | 8.2% |
| I | 1411 | 8.2% |
| t | 1400 | 8.2% |
| e | 1392 | 8.1% |
| i | 962 | 5.6% |
| r | 910 | 5.3% |
| C | 834 | 4.9% |
| Other values (9) | 4016 |
Common
| Value | Count | Frequency (%) |
| 15129 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32280 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 15129 | ||
| l | 1765 | 5.5% |
| o | 1639 | 5.1% |
| A | 1411 | 4.4% |
| B | 1411 | 4.4% |
| I | 1411 | 4.4% |
| t | 1400 | 4.3% |
| e | 1392 | 4.3% |
| i | 962 | 3.0% |
| r | 910 | 2.8% |
| Other values (10) | 4850 | 15.0% |
RecordID
Real number (ℝ)
| Distinct | 21240 |
|---|---|
| Distinct (%) | 27.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34656.267 |
| Minimum | 2 |
|---|---|
| Maximum | 94424 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 609.8 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2230 |
| Q1 | 9764 |
| median | 19182.5 |
| Q3 | 55026 |
| 95-th percentile | 88915 |
| Maximum | 94424 |
| Range | 94422 |
| Interquartile range (IQR) | 45262 |
Descriptive statistics
| Standard deviation | 29857.312 |
|---|---|
| Coefficient of variation (CV) | 0.86152708 |
| Kurtosis | -0.99364033 |
| Mean | 34656.267 |
| Median Absolute Deviation (MAD) | 16019.5 |
| Skewness | 0.65057392 |
| Sum | 2.7042978 × 109 |
| Variance | 8.9145909 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1055 | 5 | < 0.1% |
| 20882 | 5 | < 0.1% |
| 19580 | 5 | < 0.1% |
| 20871 | 5 | < 0.1% |
| 19831 | 5 | < 0.1% |
| 19332 | 5 | < 0.1% |
| 19583 | 5 | < 0.1% |
| 19832 | 5 | < 0.1% |
| 19584 | 5 | < 0.1% |
| 20872 | 5 | < 0.1% |
| Other values (21230) | 77982 |
| Value | Count | Frequency (%) |
| 2 | 2 | < 0.1% |
| 7 | 5 | |
| 10 | 5 | |
| 12 | 3 | |
| 16 | 5 | |
| 18 | 5 | |
| 20 | 5 | |
| 21 | 5 | |
| 23 | 5 | |
| 26 | 4 |
| Value | Count | Frequency (%) |
| 94424 | 1 | |
| 94423 | 1 | |
| 94419 | 1 | |
| 94371 | 1 | |
| 94321 | 1 | |
| 94319 | 1 | |
| 94318 | 1 | |
| 94317 | 1 | |
| 94313 | 1 | |
| 94293 | 1 |
isnew
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 76.3 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 56546 | |
| True | 21486 | 27.5% |
Closed
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 76.3 KiB |
| False | |
|---|---|
| True | 6415 |
| Value | Count | Frequency (%) |
| False | 71617 | |
| True | 6415 | 8.2% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.| X | Y | FID | BusinessID | Name | Address | StreetNo | StreetName | BldgNo | UnitNo | PostalCode | Location | Ward | NAICSCode | NAICSCat | NAICSDescr | Phone | Fax | TollFree | WebAddress | EmplRange | EmplUpdate | Sector_Des | CENT_X | CENT_Y | Year | PIN | Character | CHArea | Modified | BIA_NAME | BIAFulName | RecordID | isnew | Closed | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | -79.689829 | 43.644181 | 1 | 1055 | Golf Trends Inc. | 300 Ambassador Dr | 300 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 414470 | Wholesale | Amusement and Sporting Goods Wholesaler-Distributors | 905-795-8900 | 905-795-8988 | 1-800-668-1101 | lfinch@golftrendsinc.com | www.golftrendsinc.com | 10 to 19 | 2015/10/31 00:00:00+00 | 605668.2538 | 4.833187e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1055 | True | No | |||
| 1 | -79.689419 | 43.644988 | 2 | 1057 | Apex Graphics Inc. | 320 Ambassador Dr | 320 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 323120 | Manufacturing | Support Activities for Printing | 905-795-9575 | 905-795-8775 | prepress@apexgraphics.com | www.apexgraphics.com | 20 to 49 | 2016/10/31 00:00:00+00 | 605699.9370 | 4.833277e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1057 | True | No | ||||
| 2 | -79.689419 | 43.644988 | 3 | 1058 | Sands, John & Associates Limited | 320 Ambassador Dr | 320 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 323120 | Manufacturing | Support Activities for Printing | 905-795-9519 | 905-795-8775 | 50 to 99 | 2015/10/31 00:00:00+00 | 605699.9370 | 4.833277e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1058 | True | No | ||||||
| 3 | -79.689419 | 43.644988 | 4 | 1060 | Printmedia-Tackaberry Times | 320 Ambassador Dr | 320 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 323119 | Manufacturing | Other Printing | 905-564-8121 | 905-564-7395 | info@printmedia.ca | www.printmedia.ca | 1 to 4 | 2015/10/31 00:00:00+00 | 605699.9370 | 4.833277e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1060 | True | No | ||||
| 4 | -79.690664 | 43.645493 | 5 | 1061 | S W R Industries Ltd. | 321 Ambassador Dr | 321 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 417230 | Wholesale | Industrial Machinery, Equipment and Supplies Wholesaler-Distributors | 905-564-8080 | 905-564-5003 | shsieh@swrltd.com | www.swrltd.com | 5 to 9 | 2015/10/31 00:00:00+00 | 605598.6442 | 4.833332e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1061 | True | No | ||||
| 5 | -79.690277 | 43.646372 | 6 | 1063 | Crossdock Freight Solutions | 361 Ambassador Dr | 361 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 488519 | Transportation | Other Freight Transportation Arrangement | 905-670-4937 | 905-670-9475 | customerassist@crossdocksystems.com | www.crossdockfreight.com | 20 to 49 | 2015/10/31 00:00:00+00 | 605628.2838 | 4.833430e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1063 | True | No | ||||
| 6 | -79.689877 | 43.646914 | 7 | 1065 | Green Belting Industries Ltd. | 381 Ambassador Dr | 381 | Ambassador Dr | L5T 2J3 | Gateway EA (East) | 5 | 325510 | Manufacturing | Paint and Coating Manufacturing | 905-564-6712 | 905-564-6709 | 1-800-668-1114 | customerservice@greenbelting.com | www.greenbelting.com | 50 to 99 | 2016/10/31 00:00:00+00 | 605659.5646 | 4.833490e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1065 | True | No | |||
| 7 | -79.634279 | 43.640404 | 8 | 1073 | Dafco Filtration Group Corporation | 5390 Ambler Dr | 5390 | Ambler Dr | B | L4W 1G9 | Northeast EA (West) | 5 | 333413 | Manufacturing | Industrial and Commercial Fan and Blower and Air Purification Equipment Manufacturing | 905-602-1010 | 905-629-1124 | info@dafcofiltrationgroup.com | www.dafco.ca | 50 to 99 | 2016/10/31 00:00:00+00 | 610155.4182 | 4.832840e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1073 | True | No | |||
| 8 | -79.632844 | 43.641337 | 9 | 1074 | Ace Trans Inc. | 5391 Ambler Dr | 5391 | Ambler Dr | 1 | L4W 1H1 | Northeast EA (West) | 5 | 493110 | Transportation | General Warehousing and Storage | 905-625-3000 | 905-625-6049 | info@acetrans.ca | www.acetrans.ca | 1 to 4 | 2016/10/31 00:00:00+00 | 610269.4640 | 4.832945e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1074 | True | No | |||
| 9 | -79.637815 | 43.642638 | 10 | 1077 | Petro Maxx | 5510 Ambler Dr | 5510 | Ambler Dr | 1 to 2 | L4W 2V1 | Northeast EA (West) | 5 | 541490 | Professional | Other Specialized Design Services | 905-206-0040 | blake@petromaxx.ca | www.maxxgroupofcompanies.ca | 20 to 49 | 2015/10/31 00:00:00+00 | 609866.1452 | 4.833083e+06 | 2016 | NaN | NaN | NaN | NaN | NaN | NaN | 1077 | True | No |
| X | Y | FID | BusinessID | Name | Address | StreetNo | StreetName | BldgNo | UnitNo | PostalCode | Location | Ward | NAICSCode | NAICSCat | NAICSDescr | Phone | Fax | TollFree | WebAddress | EmplRange | EmplUpdate | Sector_Des | CENT_X | CENT_Y | Year | PIN | Character | CHArea | Modified | BIA_NAME | BIAFulName | RecordID | isnew | Closed | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 78022 | 608544.3664 | 4.840490e+06 | 14816 | 57550 | Advance Car & Truck Rental | 2960 Drew Rd | 2960 | Drew Rd | 149 | L4T 0A5 | NaN | 5 | 532111 | Real Estate | Passenger Car Rental | 905-461-7368 | 905-461-6666 | 1-877-303-7368 | Advancerental@gmail.com | www.advancerental.ca | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2021/06/22 00:00:00+00 | MLT | Malton BIA | 57550 | False | No | |
| 78023 | 608544.3664 | 4.840490e+06 | 14817 | 57551 | Video Palace | 2960 Drew Rd | 2960 | Drew Rd | 150 | L4T 0A5 | NaN | 5 | 532280 | Real Estate | All Other Consumer Goods Rental | 905-678-7878 | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2021/06/02 00:00:00+00 | MLT | Malton BIA | 57551 | False | No | |||||
| 78024 | 608544.3664 | 4.840490e+06 | 14818 | 57552 | Secure Life Insurance Agency Inc. | 2960 Drew Rd | 2960 | Drew Rd | 151 | L4T 0A5 | NaN | 5 | 524112 | Finance | Direct Group Life, Health and Medical Insurance Carriers | 1-800-746-9122 | www.securelifeinsurance.ca | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2018/12/30 00:00:00+00 | MLT | Malton BIA | 57552 | False | No | ||||
| 78025 | 608544.3664 | 4.840490e+06 | 14819 | 57555 | Skillman Flooring | 2960 Drew Rd | 2960 | Drew Rd | 155&157B | L4T 0A5 | NaN | 5 | 442210 | Retail | Floor Covering Stores | 905-676-9111 | 905-676-9113 | skillmanflooring@live.ca | www.skillmanflooring.com | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2019/12/12 00:00:00+00 | MLT | Malton BIA | 57555 | False | No | ||
| 78026 | 608544.3664 | 4.840490e+06 | 14820 | 57557 | Verma Vastar Manufacturing Inc. | 2960 Drew Rd | 2960 | Drew Rd | 160 | L4T 0A5 | NaN | 5 | 315210 | Manufacturing | Cut and Sew Clothing Contracting | 647-669-4545 | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2018/12/30 00:00:00+00 | MLT | Malton BIA | 57557 | False | No | |||||
| 78027 | 608544.3664 | 4.840490e+06 | 14821 | 60142 | JobsForU | 2960 Drew Rd | 2960 | Drew Rd | 156 | L4T 0A5 | NaN | 5 | 561310 | Administrative | Employment Placement Agencies and Executive Search Services | 416-825-4000 | navjot@jobsforu.ca | www.jobsforu.ca | 10 to 19 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2021/07/30 00:00:00+00 | MLT | Malton BIA | 60142 | True | No | |||
| 78028 | 608544.3664 | 4.840490e+06 | 14822 | 60159 | Elite Source Solutions | 2980 Drew Rd | 2980 | Drew Rd | 133 | L4T 0A7 | NaN | 5 | 561310 | Administrative | Employment Placement Agencies and Executive Search Services | 905-598-3542 | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2018/12/30 00:00:00+00 | MLT | Malton BIA | 60159 | True | No | |||||
| 78029 | 608544.3664 | 4.840490e+06 | 14823 | 60160 | Indian Sweet Master | 2980 Drew Rd | 2980 | Drew Rd | 134 | L4T 0A7 | NaN | 5 | 722511 | Accommodation | Full-service restaurants | 905-405-8585 | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2018/12/30 00:00:00+00 | MLT | Malton BIA | 60160 | True | No | |||||
| 78030 | 608544.3664 | 4.840490e+06 | 14824 | 60161 | Mississauga Flooring & Supplies Inc. | 2980 Drew Rd | 2980 | Drew Rd | 135 & 136 | L4T 0A7 | NaN | 5 | 414320 | Wholesale | Floor Covering Wholesaler-Distributors | 905-460-7005 | 1 to 4 | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2021/08/16 00:00:00+00 | MLT | Malton BIA | 60161 | True | No | |||||
| 78031 | 608544.3664 | 4.840490e+06 | 14825 | 60162 | Punjabi Textile Ltd. | 2980 Drew Rd | 2980 | Drew Rd | 132 | L4T 0A7 | NaN | 5 | 414110 | Wholesale | Clothing and Clothing Accessories Wholesaler-Distributors | 905-405-1919 | NaN | NaN | NaN | NaN | NaN | 2021 | 24265600.0 | NaN | Northeast EA (West) | 2018/12/30 00:00:00+00 | MLT | Malton BIA | 60162 | True | No |